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conducting  research  using  animals,  the  investigator (s) 
adhered  to  the  "Guide  for  the  Care  and  Use  of  Laboratory 
Animals,"  prepared  by  the  Committee  on  Care  and  use  of  Laboratory 
Animals  of  the  Institute  of  Laboratory  Resources,  national 
Research  Council  (NIH  Publication  No.  86-23,  Revised  1985) . 

f\l h'  For  the  protection  of  human  subjects,  the  investigator  (s) 
adhered  to  policies  of  applicable  Federal  Law  45  CFR  46. 

In  conducting  research  utilizing  recombinant  DNA  technology, 
the  investigator ( s )  adhered  to  current  guidelines  promulgateo  by 
the  National  Institutes  of  Health. 

I\j In  the  conduct  of  research  utilizing  recombinant  DNA,  the 
IH^stigator{s)  adhered  to  the  NIH  Guidelines  for  Research 
Involving  Recombinant  DNA  Molecules. 

/l//K  In  the  conduct  of  research  involving  hazardous  organisms, 
the  investigator  (s.)  adhered  to  the  CDC-NIH  Guide  for  Biosafety  in 
Microbiological  and  Biomedical  Laboratories. 
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Overview 


This  progress  report  describes  research  accomplished  during  the  period  December  31,  1996 
-  January  30,  1998.  The  manuscript  describing  the  method  for  correcting  for  measurement 
error  when  subjects  have  a  variable  number  of  repeated  measurements  of  exposure  was  pub¬ 
lished  in  The  American  Journal  of  Epidemiology  (Kim,  M.Y.  and  Zeleniuch-  Jacquotte, 
A.  “Correcting  for  measurement  error  in  the  analysis  of  case-control  data  with  repeated 
measurements  of  exposure”,  American  Journal  of  Epidemiology  (1997)  145:1003-1010.)  In 
addition,  a  paper  describing  the  results  of  the  application  of  the  technique  to  a  study  of 
repeated  measurements  of  androgens  and  risk  of  breast  cancer  in  postmenopausal  women 
appeared  in  the  same  journal.  (Zeleniuch- Jacquotte,  A.,  Bruning,  P.,  Bonfrer,  J.,  Koenig, 
K.,  Shore,  R.,  Kim,  M.,  Pastenack,  B.,  Toniolo,  P.  “Relation  of  serum  levels  of  testosterone 
and  dehydroepiandrosterone  sulfate  with  risk  of  breast  cancer  in  postmenopausal  women,” 
American  Journal  of  Epidemiology  (1997)  145:1030-1038.)  Reprints  of  the  two  articles  are 
provided  in  the  Appendix. 

A  method  for  analyzing  correlated  panel  data  was  developed  and  a  manuscript  based 
on  the  results  has  been  accepted  by  Biometrics.  Correlated  panel  data  can  arise  in  stud¬ 
ies  where  two  or  more  correlated  multi-state  processes  are  periodically  observed  on  each 
individual  and  the  exact  transition  times  between  states  are  unknown.  The  data  for  a  par¬ 
ticular  patient  consist  of  repeated  assessments  over  time  of  the  states  of  each  of  several 
possibly  correlated  clinical  variables.  A  procedure  which  models  each  process  marginally 
under  a  time-homogeneous  Markov  model  allowing  for  covariates  was  proposed.  Although 
this  technique  was  applied  to  data  from  an  AIDS  study,  the  method  is  also  applicable  to  the 
analysis  of  toxicity  data  from  clinical  trials  involving  the  treatment  of  breast  cancer  patients. 
Specifically,  one  can  utilize  the  technique  to  simultaneously  assess  whether  an  experimental 
treatment  for  breast  cancer  has  a  greater  toxic  effect  on  several  clinical  variables  compared 
with  a  standard  treatment. 

We  also  continued  work  on  the  method  for  adjusting  for  the  systematic  variability  of 
hormone  levels  over  the  menstural  cycle,  based  on  a  mixed  ANOVA  model  with  cubic 
splines.  The  method  standardizes  hormone  mecisurements  obtained  at  different  time  during 
the  menstrual  cycle,  thus  allowing  for  more  valid  comparisons  of  hormone  levels  between 
premenopausal  breast  cancer  cases  and  controls.  We  have  applied  the  method  to  adjust  the 
hormone  levels  of  premenopausal  subjects  in  a  nested  case-control  study  of  total  estradiol 
and  breast  cancer  risk.  However,  because  of  coding  problems  with  the  day-of-cycle  data 
which  subsequently  became  apparent,  the  methods  will  be  re-applied  to  a  revised  data  set. 
A  bootstrap  method  for  obtaining  variances  of  the  logistic  regression  parameters  which  takes 
into  account  the  error  associated  with  estimation  of  the  calibration  curve  has  been  developed 
but  not  yet  implemented. 


Chapter  I 


Adjusting  Hormone  Levels  for  Day  of  Men¬ 
strual  Cycle  in  Studies  of  Breast  Cancer  and 
Hormones  in  Pre-Menopausal  Women 


1  Introduction 


Although  levels  of  prolactin  and  bioavailable  estradiol  appear  to  be  relatively  stable  over 
the  phases  of  a  woman’s  menstrual  cycle,  other  hormones,  such  as  total  estradiol,  fluctuate 
considerably.  (Toniolo  et  ah,  1993;  Koenig  et  ah,  1993.;  Wu  et  ah,  1976;  Takatani  et  ah, 
1991).  Epidemiologic  studies  investigating  the  association  of  total  estradiol  and  risk  of  breast 
cancer  among  premenopausal  women  must  adjust  a  subject’s  hormone  level  for  day  of  cycle 
either  in  the  design  or  analysis  stage  of  the  study,  in  order  for  the  comparisons  between  cases 
and  controls  to  be  valid. 

In  the  NYU  Women’s  Health  Study,  a  nested  case-control  study  of  serum  hormonal  levels 
and  breast  cancer,  one  of  the  criteria  for  matching  controls  with  a  breast  cancer  case  among 
pre-menopausal  women  was  the  day  of  menstrual  cycle  on  which  the  first  blood  specimen  was 
collected,  measured  in  number  of  days  prior  to  next  expected  onset  of  menses.  Subsequent 
blood  donations,  however,  could  not  be  matched  on  day  of  cycle.  Therefore,  a  method 
was  needed  to  standardize  hormone  measurements  obtained  at  different  times  during  the 
menstrual  cycle  for  subjects  in  the  same  matched  set. 

Rosenberg  et  al  (1994)  used  the  first  measurement  from  each  control  subject  to  fit  a  three- 
piece  spline  model  to  describe  the  change  in  total  estradiol  level  over  the  menstrual  cycle. 
For  each  subject,  the  estradiol  measurement  adjusted  for  day  of  cycle  was  then  calculated 
as  the  difference  between  the  observed  value  and  the  expected  value  from  this  calibration 
curve,  measured  in  units  of  standard  deviation.  The  limitation  with  this  approach,  however, 
is  that  because  only  the  first  measurement  from  each  subject  was  used  to  fit  the  calibration 
curve,  the  curve  is  estimated  with  less  precision  than  one  that  is  estimated  using  all  available 
repeated  measurements.  In  addition,  the  width  of  the  confidence  intervals  for  the  relative 
risks  for  breast  cancer  based  on  the  adjusted  estradiol  measurements  are  underestimated, 
since  they  do  not  take  into  account  the  extra  variation  due  to  estimation  of  the  parameters 
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of  the  calibration  curve. 

We  propose  an  alternative  method  for  describing  the  within- subject  change  in  estradiol 
levels  over  the  menstrual  cycle,  based  on  a  mixed  linear  model  with  cubic  splines,  which 
utilizes  all  the  repeated  measurement  data  for  each  subject.  The  use  of  cubic  splines  in  the 
model  yields  a  smoother  curve  than  the  one  fit  by  Rosenberg  et  al,  which  was  based  on  a 
three-piece  spline:  two  parabolas  and  a  straight  line,  without  smoothed  join  points. 

We  use  the  results  from  the  mixed  linear  model  to  adjust  each  subject’s  hormone  level  for 
day-of-cycle.  The  adjusted  measurement  then  becomes  the  exposure  in  a  conditional  logistic 
regression  analysis.  Bootstrap  methods  are  utilized  to  obtain  estimates  of  the  corresponding 
95%  confidence  intervals  for  the  regression  coefficients  which  account  for  the  variation  in  the 
estimated  calibration  curve. 

2  Methods 

Let  y,-  =  {Viii  --,yiki}  denote  the  vector  of  hormone  levels  for  the  woman  mecisured  on  A;,- 
occasions  for  i  =  1,  ...n.  Furthermore,  let  t,-  =  {tn,  denote  the  vector  of  the  number 

of  days  prior  to  next  menses  at  which  the  y,  were  measured.  We  assume  a  mixed  linear 
model  of  the  form 

Vij  ~  "h  "h  "h  ^xj 

where  //  denotes  an  overall  mean,  a,  denotes  a  random  subject  effect  from  a  distri¬ 

bution,  S{tij)  is  a  cubic  spline  function,  and  the  e,j  are  independent  errors  from  a  N{0,crl) 
distribution.  We  further  assume  that  the  subject  effects  and  the  error  terms  are  mutually 
independent.  The  above  model  implies  that  the  correlation  between  repeated  measurements 
of  hormones  on  the  same  subject  is  equal  to  -f  cr^). 

We  chose  to  use  cubic  splines  to  model  estradiol  levels  versus  day  of  cycle  because  this 
method  provides  great  flexibility  in  fitting  models,  is  visually  smoth,  and  requires  fewer 
constants  to  fit  than  higher  degree  splines.  Rosenberg  et  al  utilized  two  parabolic  and  one 
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linear  function  to  describe  the  change  in  estradiol  over  the  menstrual  cycle,  with  only  a 
single  continuity  restriction.  Thus,  although  their  overall  function  was  continuous,  it  was 
not  smooth  at  the  two  join  points. 

When  fitting  a  cubic  spline  model,  more  join  points  or  knots  are  better  if  the  variable 
changes  quickly  over  the  covariate  space.  However,  too  many  knots  can  lead  to  over-fitting 
of  the  data  and  more  parameters  to  estimate.  Stone  (1986)  suggested  that  5  knots  should 
provide  enough  flexibility  for  a  reasonable  number  of  degrees  of  freedom. 

Given  that  the  average  length  of  a  menstrual  cycle  is  28  days,  we  positioned  5  knots  at 
the  5  day  intervals:  5,  10,  15,  20,  and  25  days  prior  to  next  menses.  Using  the  +  notation 
of  Smith  (1979),  let 

—  u  if  u  >  0 
=  0  if  u  <  0. 

Then  the  cubic  spline  can  be  specified  as: 

S{t)  =  ySo  +  +  02^"^  +  03^^  +  ~  5)^  ^5(f  —  10)^ 

+  -  15)^  +  Mi  -  20)^  +  Mi  -  25)^ 

It  follows  that  the  overall  mixed  linear  model  has  the  following  form: 

Vij  =  +  Mij  +  Mlj  +  Mlj  +  ^4{iij  ~  5)+  +  ^^{iij  —  10)  + 

-h  Miij  -  15)+  +  MUj  -  20)^  +  Miii  -  25)5-  +  e.-,-  (1). 

This  model  assumes  that  the  shape  of  the  function  describing  the  change  in  estradiol  over 
the  menstrual  cycle  is  the  same  for  all  subjects,  but  that  subjects  can  differ  with  regard  to 
their  baseline  level  on  day  0. 

Several  techniques  can  be  utilized  to  obtain  estimates  for  the  variance  components  and 
regression  parameters  in  a  mixed  linear  model,  including  traditional  analysis  of  variance 
(ANOVA)  methods,  maximum  likelihood  (ML)  methods,  and  restricted  maximum  likelihood 
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(REML)  methods.  REML  estimates  of  the  variance  components  are  generally  preferred,  since 
ML  estimates  do  not  take  into  account  the  degrees  of  freedom  used  to  estimate  the  fixed 
effects,  which  can  result  in  estimates  of  variance  components  which  are  downwardly  biased 
(Laird  and  Ware,  1982).  The  ANOVA  methodology  also  has  limitations,  especially  with 
unbalanced  data,  such  as  negative  variance  estimates,  and  lack  of  distributional  properties 
(Searle  et  al).  For  these  reasons,  we  used  the  REML  method  from  the  SAS  PROC  MIXED 
procedure  to  fit  model  (1). 

Once  the  parameters  in  model  (1)  are  estimated,  estradiol  levels  adjusted  for  day  of  cycle 
can  be  computed  using  several  approaches.  One  approach  is  to  calculate  the  deviation  of 
the  subject’s  observed  value  from  the  expected  value  for  that  day  of  the  cycle  based  on  the 
fitted  curve; 

Xij  =  Vij  -  HUj)-  (2) 

Similarly,  when  repeated  hormone  measurements  are  available  on  all  subjects  and  the  average 
hormonal  level  is  used  as  the  exposure,  the  average  adjusted  for  day  of  cycle  can  be  calculated 
as: 

Vij  -  S{tij)}/n.  (3) 

j 

An  alternative  approach  is  to  use  the  estimate  of  the  random  subject  effect,  A,-,  from  (1) 
for  each  i  =  The  best  linear  unbiased  predictor  (BLUP)  of  a,  is 

the  expected  value  of  aj,  conditional  on  and  d'g,  which  is  also  the  empirical  Bayes 

estimator  of  q,-.  It  can  be  shown  that 

E{ai\yi,  (4) 

where 


Note  that  can  also  be  interpreted  as  the  reliability  coefficient  of  y,. 
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Estradiol  levels  adjusted  for  day  of  cycle  can  be  computed  using  one  of  the  above  ap¬ 
proaches  for  all  subjects  and  then  used  as  the  exposure  in  the  usual  logistic  (or  conditional 
logistic  for  matched  studies)  regression  model  to  evaluate  the  association  between  estradiol 
level  and  risk  of  breast  cancer  among  pre-menopausal  women.  The  estimates  of  the  standard 
errors  for  the  regression  parameter  will  be  underestimated,  however,  since  the  uncertainty 
associated  with  the  estimates  for  the  calibration  curve  are  not  taken  into  account.  In  this  pa¬ 
per,  we  utilize  a  bootstrap  algorithm  for  obtaining  estimates  of  the  confidence  intervals  which 
include  the  variability  contributed  by  estimation  of  the  calibration  curve.  The  algorithm  is 
described  in  the  next  section. 

3  Results 

The  primary  aim  of  the  NYU  Women’s  Health  Study  is  to  determine  whether  endogenous 
hormones  such  as  estradiol,  are  associated  with  risk  of  breast  cancer.  Between  March  1985 
and  June  1991,  a  cohort  of  healthy  women  aged  34-65  years  were  enrolled  at  the  Guttman 
Breast  Diagnostic  Institute,  New  York.  At  the  time  of  enrollment  and  at  annual  screening 
visits  thereafter,  women  were  asked  to  donate  blood  and  complete  a  self-administered  ques¬ 
tionnaire.  Serum  samples  were  frozen  and  stored  for  future  biological  assays.  Subsequent 
cases  of  breast  cancer  were  identified  primarily  through  active  follow-up  and  confirmed  by 
reviewing  medical  and  pathological  records. 

In  order  to  limit  the  costs  associated  with  measuring  hormone  levels  in  the  cohort,  a 
nested  case-control  study  design  was  used.  For  each  incident  case  of  breast  cancer,  indi¬ 
vidually  matched  controls  were  selected  at  random  from  the  risk  set  consisting  of  all  cohort 
members  alive  and  free  of  breast  cancer  at  the  time  of  diagnosis  of  the  case,  and  who  matched 
the  case  on  menopausal  status  at  entry,  age  at  entry,  and  number  and  approximate  dates  of 
blood  donations  up  to  the  date  of  diagnosis  in  the  case.  For  additional  details  of  the  study 
design,  see  Toniolo  et  al  (1991).  The  association  between  endogenous  estrogens  and  breast 
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cancer  in  post-menopausal  women  was  reported  in  Toniolo  et  al  (1995).  In  this  paper,  we 
focus  on  the  association  between  total  estradiol  and  risk  of  breast  cancer  in  pre-menopausal 
women. 

A  total  of  498  estradiol  measurements  from  367  pre-menopausal  control  subjects  were 
utilized  to  fit  the  calibration  curve:  278  subjects  had  1  measurement,  60  had  2,  28  had  3, 
and  4  had  4  measurements.  Only  measurements  obtained  less  than  35  days  prior  to  next 
menses  were  included.  Total  estradiol  levels  were  log  transformed  prior  to  model  fitting  to 
adjust  for  deviations  from  normality. 

The  estimated  mean  curve  describing  the  change  in  log  estradiol  level  over  the  men¬ 
strual  cycle  is  shown  in  Figure  1.  The  parameters  in  model  (1)  were  estimated  as  follows: 

fi  =  4.16,  A  =  •36,yS2  =  -.029,^03  =  -.00067,^4  =  .00061, /05  =  -•012,/36  =  .012,(07  = 
—.0077,(08  =  .0052.  These  estimates  were  then  used  to  calculate  levels  of  total  estradiol 
adjusted  for  phase  of  menstrual  cycle  using  the  three  approaches  described  above. 

The  results  from  fitting  conditional  logistic  regression  models  to  the  adjusted  total  estra¬ 
diol  levels  are  shown  in  Table  1.  The  estimate  of  the  logistic  regression  coefficient  based  on 
the  adjusted  first  and  average  measurements  calculated  from  (2)  and  (3),  respectively,  are 
similar  to  the  estimate  using  the  unadjusted  first  measurement  because  in  the  original  study 
design,  cases  were  matched  to  controls  according  to  the  phase  of  menstrual  cycle  of  the  first 
measurement. 

In  contrast,  when  the  emprical  Bayes  estimator  from  (4)  was  utihzed  as  the  exposure, 
the  regression  coefficient  estimate  increased  substantially.  This  increase  is  not  surprising, 
given  that  the  estimator  in  (4)  can  be  viewed  as  an  estimator  of  (3)  that  has  been  corrected 
for  measurement  error.  Whittemore  (1989),  and  Armstrong,  Whittemore  and  Howe  (1989), 
have  proposed  analogous  forms  of  (4)  as  a  method  for  correcting  for  measurement  error  in 
linear  and  logistic  regression  models.  The  method,  commonly  referred  to  as  “Stein  shrink¬ 
age”  ,  involves  multiplying  an  exposure  variable  measured  with  error  by  the  reliability  of  the 
exposure  prior  to  fitting  the  regression  model  to  obtain  corrected  coefficient  estimates.  In 
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the  absence  of  confounders,  measurement  error  in  the  exposure  variable  will  result  in  relative 
risk  estimates  that  are  attenuated  compared  to  the  true  relative  risk.  It  follows  that  meth¬ 
ods  which  correct  for  measurement  error  should  yield  higher  estimates  of  relative  risk  than 
the  uncorrected  estimates,  and  that  the  regression  coefficent  based  on  the  empirical  Bayes 
estimator  should  be  higher  than  the  uncorrected  average.  It  should  be  noted,  however,  that 
this  estimator  does  not  completely  correct  for  measurement  error  because  the  reliability  co¬ 
efficient  in  (5)  should  be  adjusted  for  the  matching  strata  (ref.  Kim  et  al,  1995).  That  is,  the 
between-subject  variance  component  may  be  overestimated  if  it  is  not  computed  controlling 
for  the  variation  due  to  matching. 

The  width  of  the  confidence  intervals  in  Table  (1)  are  underestimated  because  the  extra 
variability  due  to  estimation  of  the  calibration  curve  is  not  taken  into  account.  To  ob¬ 
tain  standard  errors  which  incorporate  this  additional  source  of  variation,  we  propose  the 
following  bootstrap  procedure: 

1.  Generate  a  bootstrap  sample  from  the  control  subjects. 

2.  Fit  model  (1)  to  the  bootstrap  sample  to  estimate  the  parmeters  of  the  calibration 
curve 

3.  Generate  a  bootstrap  sample  from  the  matched  cases  and  controls,  using  the  matching 
stratum  as  the  sampling  unit. 

4.  Adjust  the  total  estradiol  measurements  for  day  of  menstrual  cycle  using  the  estimates 
from  step  (2). 

5.  Fit  conditional  logisitc  regression  models  to  the  adjusted  total  estradiol  measurements. 

6.  Repeat  (l)-(4)  1,000  times,  which  is  the  approximate  minimum  number  of  bootstraps 
necessary  to  compute  biaa-corrected  confidence  limits. 
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The  simple  (1  —  a)%  confidence  interval  can  be  constructed  using  the  a/2  and  (1  —  a/2) 
percentiles  of  the  bootstrap  distribution.  Bias-  corrected  confidence  intervals  should  be  used 
when  the  bootstrap  distribution  of  the  regression  parameter  is  asymmetric  and  when  the 
sample  size  is  small. 

4  Conclusions 

The  computer  program  for  generating  the  bootstrap  sample  described  above  has  been  de¬ 
veloped  using  the  SAS  macro  facility,  but  still  needs  to  be  tested.  In  addition,  progress  on 
this  project  has  been  slower  than  anticipated  because  of  problems  with  the  coding  of  the 
day-of-cycle  data  which  needed  to  be  resolved.  For  example.  Figure  1  shows  that  an  unusu¬ 
ally  large  number  of  subjects  gave  a  blood  sample  20  days  prior  to  the  next  menstrual  cycle. 
This  is  an  artifact  of  the  coding  rule  that  cohort  members  with  irregular  or  unknown  cycle 
lengths  and  whose  blood  donation  occurred  between  day  1  and  day  8  after  the  first  day  of 
last  menustrual  cycle  were  considered  to  have  been  in  the  early  follicular  phase  and  at  20 
days  prior  to  next  menses.  Because  this  coding  mechanism  could  adversely  impact  the  fit  of 
the  model  in  the  area  of  the  curve  to  the  right  of  20  days  prior  to  next  menses  in  Figure  1, 
model  (1)  needs  to  be  refitted  to  the  data  after  identifying  and  excluding  the  measurements 
which  were  artificially  assigned  to  the  time  of  20  days  prior  to  next  menses.  We  anticipate 
that  this  will  be  accomplished  by  the  end  of  this  final  year  of  the  funding  period,  along  with 
implementation  of  the  bootstrap  method  and  preparation  of  a  manuscript  describing  the 
methods  and  results. 
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A  mixed 


Table  1:  Logistic  Regression  Parameter  Estimates  and  95%  Confidence  Intervals  for 
the  Associations  of  Total  Estradiol  Level  and  Risk  of  Breast  Cancer  in  Pre- menopausal 
Women 


Exposure  Variable 

Regression  Coefficient  95%C.L 

Total  Estradiol 

First  measurement 

Adjusted  first  measurement 
Adjusted  average 

Empirical  Bayes  adjusted  average 

0.19  (-0.23  -  0.61) 

0.26  (-0.17  -  0.70) 

0.17  (-0.28  -  0.63) 

1.52  (-0.83  -  3.87) 

Chapter  II 


The  Analysis  of  Correlated  Panel  Data  Using 
a  Continuous  Time  Markov  Model 


1  Introduction 

Many  biological  processes  may  be  described  in  terms  of  a  finite  number  of  states  which 
individuals  proceed  through  over  time.  For  example,  Klein  et  al  (1984)  and  Kay  (1986) 
have  modeled  the  phases  of  cancer  as  a  three  stage  disease  process. 

In  most  longitudinal  studies  which  follow  the  passage  of  individuals  through  various 
biological  states,  the  available  data  consist  of  the  series  of  states  each  subject  was 
observed  to  be  in  at  various  points  in  time.  Because  monitoring  is  not  continuous, 
information  about  the  process  is  usually  unavailable  between  follow-up  times,  and  the 
exact  times  of  transition  from  one  state  to  another  are  not  known.  This  type  of  data, 
in  which  the  observations  consist  of  the  states  occupied  by  the  individuals  under  study 
at  a  sequence  of  discrete  time  points,  is  often  referred  to  as  “panel”  data.  Methods  for 
analyzing  panel  data  using  time- homogeneous  Markov  models  have  been  explored  by 
Klein  et  al.  (1984),  Kalbfleisch  and  Lawless  (1985),  and  Gentleman  et  al  (1994). 

Existing  methods  for  the  analysis  of  panel  data  have  been  applicable  to  the  case 
where  a  single  biological  process  is  of  interest.  However,  many  clinical  studies  monitor 
two  or  more  processes  over  time  on  each  individual,  where  the  processes  observed  within 
a  subject  may  be  correlated.  For  example,  in  certain  longitudinal  studies  involving 
HIV-infected  subjects,  several  immunologic  variables,  such  as  CD4,  CD8  and  serum 
immunoglobulins  are  each  measured  periodically  on  all  subjects.  Also,  in  most  clinical 
trials,  several  possibly  correlated  clinical  variables  are  repeatedly  assessed  during  the 
follow-up  period  of  each  patient. 

In  the  clinical  trial  which  motivated  the  methodology  proposed  in  this  paper,  pa¬ 
tients  with  the  Acquired  Immune  Deficiency  Syndrome  (AIDS)  were  randomized  to 
receive  one  of  two  drugs:  ganciclovir  or  foscarnet,  for  the  treatment  of  cytomegalovirus 
(CMV)  retinitis,  and  were  periodically  monitored  for  the  development  of  toxicities  in 
several  hematologic,  enzymatic,  and  chemical  variables.  For  each  variable,  five  states 
of  toxicity  were  defined:  none,  mild,  moderate,  severe  and  life-threatening,  according 
to  the  AIDS  Clinical  Trials  Group  (ACTG)  toxicity  grading  system.  The  data  for  a 
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particular,  patient  consisted  of  repeated  assessments  over  time  of  the  toxicity  states  of 
each  of  several  possibly  correlated  clinical  variables,  yielding  what  we  refer  to  as  “cor¬ 
related  panel”  data.  To  our  knowledge,  statistical  methods  for  analyzing  correlated 
panel  data  have  not  been  previously  developed. 

Wei,  Lin  and  Weissfeld  (1989)  described  a  method  for  analyzing  multivariate  sur¬ 
vival  data  which  models  each  survival  distribution  marginally  using  a  Cox  proportional 
hcizards  model,  and  makes  no  specific  assumptions  regarding  the  structure  of  depen¬ 
dence  between  distinct  failure  times  on  each  subject.  This  approach  has  also  been 
utilized  by  Stram,  Wei  and  Ware  (1988).  In  this  paper,  we  apply  the  marginal  ap¬ 
proach  to  the  analysis  of  correlated  panel  data.  We  model  each  process  based  on  a 
time-homogeneous  Markov  model  allowing  for  covariates  and  impose  no  specific  de¬ 
pendence  structure  among  the  related  processes.  The  resulting  estimators  are  shown 
to  be  asymptotically  jointly  normal  with  a  covariance  matrix  that  accounts  for  the 
dependence  among  related  processes  and  can  be  consistently  estimated.  Simultaneous 
inference  procedures  are  also  proposed.  In  Section  2,  the  method  for  analyzing  corre¬ 
lated  panel  is  described,  and  in  Section  3  an  example  is  presented  using  data  from  the 
CMV  clinical  trial. 


2  Methods 

2.1  Markov  Modeling  and  Estimation 

Assume  that  K  processes  are  periodically  observed  on  each  subject,  and  that  each 
process  has  Q  ordered  states.  We  model  the  A:**  process,  {k  =  as  a  time- 

homogeneous  Markov  process,  where  the  first  Q  —  1  are  transient  states,  and  the  Qth  is 
a  single  absorbing  state.  Transitions  are  assumed  to  occur  irreversibly  from  one  state  to 
the  next.  The  time- homogeneous  Markov  assumption  implies  that  times  to  transition 
between  states  are  exponentially  distributed  with  a  hazard,  or  transition  intensity,  that 
may  be  modeled  as  a  function  of  covariates. 
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For  notational  convenience,  we  assume  a  single  covariate,  z,  is  of  interest.  Then, 
following  the  proportional  hazards  model  of  Cox  (1972),  the  transition  intensities  for 
each  process  can  be  modeled  as, 

^q(^)  =  q  =  l, ...,  Q-l,k  =  l, ...,  K 

where  A*  is  the  baseline  transition  intensity  between  states  q  and  9  +  1  for  the  A:th 
process,  and  is  the  regression  parameter  denoting  the  effect  of  on  A*. 

For  the  subject  {i  =  1, ...,  n)  with  covariate  z,-,  the  probability  that  the  process 
will  be  in  state  s  at  time  ti+t,  given  that  the  process  was  in  state  r  at  time  ti,  is  denoted 
as  Prsii^  ^i)‘  Then,  under  the  assumptions  that  A|(^,)  ^  Af(2,),  I  ^  j,j,  I  —  1,  — 

standard  methods  may  be  utilized  to  show  that  this  transition  probability  is  equal  to 

n‘=.,w{AK^i)  - 

where  r  =  —  1;5  >  r,  and  s  <  Q  (Chiang,  1980).  The  transition  probabilities 

from  a  transient  state  r  to  the  absorbing  state  Q  are  equal  to 

Q-i 

PrQii^  ^i)  =  1-11  Prji^^  ^  =  1’  -<5  -  1) 

i=r 

and  p'^Q{t,Zi)  =  1. 

Now,  let  Sij  denote  the  state  of  the  fcth  process  for  the  ith  subject  at  the  jth 
observation  {j  =  1, , .,  m,),  at  time  measured  in  days  since  date  of  initial  observation. 
Then  the  observed  history  of  the  process  for  the  subject  may  be  denoted  as 


Pts{t,^i)  =  (-1)* 


j=r 


(1) 


Ak  —  ( „k  ±k  k  fk  \ 

—  V*«l» •••)  ■’imp ''tTOi/' 


Following  the  Markov  property  and  conditional  probability  arguments,  the  probability 
of  observing  Af ,  conditional  on  can  be  expressed  as, 

Pr(Af|z,)  =  Pr(Z:‘^  process  in  state  at  time  tfj, ..., state  at  time  tf„jjzi) 

m*— 1 
j=i 
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where  the  form  of  —  tij,  ^i)  -may  be  derived  from  (1).  Then  the  marginal 

likelihood  function  for  the  process  over  all  n  subjects  is 

m)  =  f[Pr{Al\zi) 

t=l 

n  (mi-l  I 

=  n  ]  II  ~  [  5 

>=i  I  j=i  ) 

where  9k  =  The  corresponding  log-likelihood  is  equal  to 


m) = E 

t=i 


m*  —  1 

i=l 


(2) 


A 

Substituting  (1)  in  (2),  the  maximum  likelihood  estimator,  6k^  of  0k ^  for  each  process 
may  be  obtained  by  solving  dC{9k)l dBk  =  0,  for  k  =  It  follows  that  under 

the  usual  regularity  conditions,  y/n{9k  —  9k)  for  each  process  is  asymptotically  normal 
with  mean  0  and  covariance  matrix  Ik^{9k),  where  lk{0k)  is  defined  as 

[  ld^C{9k)'\  1*1  (aa9u))  (dC(9u))^ 

h{^k)  =  Ee^. 


n  891 


l\dC{9k)\\dC{9k)y 


n 


(3) 


The  estimates  of  9k  from  different  processes  may  be  correlated.  To  take  into  account 
the  correlations  among  (9jt’s,  consider  the  Taylor  series  expansion 

Uk 


where 


and 


V^{9k  -  9k)  ~  Ik 


1  TT  _  1  ^  dCi{9k)  _  1  ^  y? 

d9k 

Ik{0k)-  ^  • 


(4) 


By  the  above  development,  y/n  < 


— 

yh  ) 

j 

can  be  written  as  the  sum  of 


independent  vectors  with  mean  0,  and  thus,  under  the  usual  regularity  conditions. 
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follow  a  multivariate  normal  distribution  .with  mean  0  and  covariance  matrix  S,  which 
may  be  partitioned  as 


£ii 

Si2 

•  •  •  ^IK 

Si2 

S22 

•  •  •  ^2K 

SiA' 

^2K 

■  •  •  ^KK 

where  and  S^jt  is  the  covariance  matrix  between  \/n{0j  —  6j)  and 

\/n{9k  —  0k)-  From  (4),  the  covariance  matrix  of  {Ukly/n,  Ujfy/n)  can  be  estimated 
consistently  by 


i=l 


(5) 


where  is  equal  to  dCi{9k)/d9f:  evaluated  at  Ci{9k)  is  the  contribution  to  the 

log-likelihood  made  by  the  subject.  The  term  ^ij  is  defined  similarly.  Since  {0k) 

and  lk^{0k)  =  Z)"=i ^  are  equivalent  in  probability,  it  follows  that  one  can 


estimate  by 

kHh), 

for  j  ^  k  and  j,  k  =  1, ...,  K. 


2.2  Hypothesis  Testing 

The  multivariate  normal  distribution  of  provides  a  basis  for  making  si- 

multaneous  inferences  about  the  0kS.  In  particular,  suppose  that  we  are  interested 
in  evaluating  the  effect  of  z,-  on  the  transition  intensities  for  the  K  processes.  Let 
V  =  {Vu-,Vkx(Q-i)}  =  {^n,-,0Q-i,i,-,l3iK,-jQ-i,K)  denote  the  parameters  of 
interest,  and  let  $  denote  the  estimate  of  the  covariance  matrix  of  rj,  which  can  be 
obtained  from  S.  Then  asymptotically  the  following  holds 

w  =  ~  x|kx(8-.))-  (6) 

The  above  W  statistic  can  then  be  used  to  simultaneously  test  the  null  hypotheses  Hf. 
7//  =  0,  for  /  =  1,  X  (Q  —  1).  If  the  null  hypotheses  are  rejected,  the  next  step  is 
to  decide  which  of  the  rji  are  non-zero. 
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Let  {^1,^2)  •••5^K’x(Q-i)}  denote  the  standardized  estimator  of  ■■■■>Vkx{q-i)}', 

where  fji  =  and  ^//  is  the  element  of  Under  the  null  hypotheses 

Hi,  {iji,  ...,ijKx(Q-i)}  is  approximately  normal  with  mean  0  and  covariance  matrix 

/wAAA  .  A  A 

$  =  {4'pq}i  where  <j)pq  =  ^pq! {<i>pp<i>qqy'^ ■,  and  <f>pq  is  the  (p,  q'jth  element  in  $.  A 
conventional  multiple  testing  procedure  rejects  Hi,  I  =  x  Q  —  I,  if  \fji\  >  c, 

where  c  is  the  smallest  number  such  that 

Pr{\rji\  <cj  =  X  Q\Hi,...,HK)iQ-i)  <  1  -  a, 

and  a  is  a  prespecified  level  of  signifcance.  However,  the  sequential  multiple  test 
procedures  studied  by  Marcus,  Peritz  and  Gabriel  (1976),  Holm  (1979)  and  Wei  and 
Stram  (1988)  will  yield  more  powerful  tests  than  the  above  conventional  multiple  testing 
procedures. 

Following  Wei  and  Stram  (1988),  let  p*  be  the  /th  largest  absolute  value  of  the 
rji's,  and  let  be  the  corresponding  variance-covariance  matrix  that  is  obtained  by 
rearranging  the  rows  and  columns  of  Also,  let  H^  :  p*  =  0  be  the  ordered  hypotheses 
from  the  ff/’s  according  to  the  order  of  pj, ...,  Furthermore,  let  (Zi, ...,  ZkxQ-i) 

denote  a  multivariate  normal  vector  with  mean  zero  and  covariance  matrix  $*.  Starting 
with  the  hypothesis  H^,  we  reject  77*,/  =  X  Q  —  1,  ii  P{mmi<j<KxQ-i  ^ 

^  o;/2,  where  a  is  a  prespecified  two-sided  level  of  significance,  provided  that 
Hi,  ...,Hi_i  have  been  tested  and  rejected.  It  can  be  shown  that  the  eisymptotic  type  I 
error  probability  of  this  procedure  is  a  for  any  combination  of  true  77/s.  An  illustration 
of  the  sequential  multiple  testing  procedure  is  provided  in  the  example. 

3  Example 

The  methods  developed  in  the  previous  section  were  applied  to  data  derived  from 
the  CMV  retinitis  clinical  trial.  This  trial,  conducted  by  the  Studies  of  Ocular  Com¬ 
plications  of  AIDS  Research  Group,  weis  designed  to  compare  ganciclovir  (Cytovene, 
Syntex  Laboratories)  and  foscarnet  (Foscavir,  Astra  Pharmaceutical  Products)  in  the 
treatment  of  CMV  retinitis  in  AIDS  patients  (SOCA,  1992). 
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Two  hundred  and  thirty-four  patients  were  randomly  assigned  to  treatment:  127 
to  ganciclovir  and  107  to  foscarnet.  One  of  the  objectives  of  this  trial  was  to  compare 
the  toxicity  profiles  of  the  two  drugs.  We  focus  particularly  on  the  toxic  effects  of 
the  treatments  on  the  two  hematologic  variables:  hemoglobin  and  absolute  neutrophil 
count,  which  could  be  correlated  within  individuals.  Five  stages  of  toxicity  were  defined 
for  each  outcome  based  on  the  following  ACTG  toxicity  criteria.  For  HGB,  the  stages 
were  defined  as,  none:  >  W.^gfdl,  mild:  9.5  —  10.9p/d/,  moderate:  8.0  —  9Agfdl, 
severe:  6.5  —  7.9gfdl  and  life- threatening: <  6.5gldl.  The  corresponding  toxicity  grades 
for  ANC  were,  none:  >  15001  fil,  mild:  1000-1499////,  moderate:  750-999////,  severe: 
500— 74:9 f  fill  and  life-threatening:  <  500////.  It  should  be  emphasized  that  even  though 
the  quantitative  levels  of  each  variable  were  known,  the  grade  of  toxicity  rather  than 
the  actual  level  was  more  clinically  relevant.  In  addition,  because  very  few  subjects 
were  observed  to  progress  to  the  life-threatening  stage  of  both  HGB  and  ANC  toxicity, 
this  state  was  combined  with  the  severe  toxicity  state  to  yield  four  possible  states  for 
each  process.  The  goals  of  the  analysis  were  to  estimate  and  compare  the  effects  of 
foscarnet  and  ganciclovir  on  the  transition  intensities  between  the  states  of  toxicity  of 
HGB  and  ANC,  and  to  estimate  the  average  waiting  time  in  each  state. 

The  methods  in  the  previous  section  were  based  on  the  assumption  that  transitions 
between  states  occur  irreversibly.  Levels  of  HGB  and  ANC  may  not  decrease  mono- 
tonically  over  time,  but  could  increase  from  one  visit  to  the  next.  When  this  occurred 
in  our  data  set,  the  process  was  simply  held  in  the  state  defined  by  the  previous  ob¬ 
servation  until  it  was  observed  to  decline  again,  thus  allowing  us  to  model  the  states 
as  irreversible.  The  frequencies  of  occurrences  in  which  the  processes  were  observed 
to  increase  rather  than  decrease  from  one  visit  to  the  next  were  relatively  low  and 
were  similar  in  the  two  groups  (ANC:  20%  vs.  18%  for  foscarnet  and  ganciclovir,  re¬ 
spectively;  HGB:  21%  vs.  18%  for  foscarnet  and  ganciclovir,  respectively),  so  that  the 
assumption  of  the  progressive  model  would  not  likely  bias  the  estimates  of  treatment 
effects. 

The  covariate  denoting  treatment  group  for  the  subject  is  defined  as  2;i  =  0 
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for  Foscarnet,  and  Zi  =  1  for  Ganciclovir.  The  parameter  vector  for  HGB  is  de- 
noted  as  6^  =  '^3  5^15^2  5/^3)  >  corresponding  vector  for  ANC  is  62  = 

where  is  the  coefficient  corresponding  to  the  effect  of  treat¬ 
ment  on  the  transition  intensity  between  states  q  and  5  -f- 1  for  process  k. 

Tables  1  and  2  show  the  estimates  of  the  transition  intensities  and  the  ^  coefficients 
for  the  two  processes.  The  mean  waiting  time  for  the  state  was  estimated  by 
l/(Ae^^).  The  mean  waiting  times  in  each  state  of  ANC  for  ganciclovir  are  less  than 
half  the  mean  waiting  times  for  foscarnet,  indicating  the  toxic  effects  on  ANC  are 
more  severe  with  ganciclovir.  On  the  other  hand,  for  HGB,  foscarnet  appears  to  be 
associated  with  a  higher  probability  of  transition  from  mild  to  moderate,  and  moderate 
to  severe  toxicity. 

Figures  1  and  2  show  the  estimated  cumulative  distribution  functions  of  time  to 
severe  toxicity  of  HGB  and  ANC,  respectively,  for  each  treatment.  The  difference  be¬ 
tween  treatments  in  the  probability  of  reaching  severe  toxicity  is  much  greater  for  ANC 
than  HGB.  The  Kaplan-Meier  estimates  of  the  distribution  functions  are  also  shown  for 
comparison.  Our  model  appears  to  underestimate  the  probabilities  of  progressing  to 
severe  toxicity  relative  to  the  Kaplan-Meier  estimates.  The  lack-of-fit  of  the  model  was 
also  apparent  in  results  (not  shown)  of  the  goodness-of-fit  test  proposed  by  Gentleman 
(1994)  that  was  performed.  The  inadequacy  of  the  assumed  model  could  indicate  that 
the  estimates  of  the  regression  coefficients  and  corresponding  covariance  matrix  may 
not  be  valid;  however,  for  illustrative  purposes  we  proceed  to  present  the  results  of  the 
sequential  multiple  testing  procedure. 

The  covariance  matrix  for  01, ^li ^2^ ^i)  estimated  to  be 

^  0.0560  0.0020  0.0023  0.0006  0.0116  -0.0009  ^ 

0.0689  -0.0006  0.0087  0.0072  -0.0004 

0.1072  0.0088  -0.0023  0.0082 

0.0861  0.0024  -0.0014 


V 
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0.0692  0.0082 

0.0968  J 


To  jointly  test  =  0,  /  =  1,2, 3;  A;  =  1,2,  we  calculated  the  test  statistic,  W, 

from  (4) ,  which  approximately  follows  a  distribution  with  6  degrees  of  freedom.  The 
observed  W  was  42.96  which  is  significant  at  less  than  1%  level. 

In  order  to  evaluate  which  of  the  coefficients  were  non-zero,  we  employed  the  mul¬ 
tiple  testing  procedure  described  in  the  previous  section.  The  standardized  parameter 
estimates  were  3.477,  3.910,  2.629,  -1.721,  -1.039,  and  0.426  re¬ 
spectively.  Since  -\^l\  <  -\^\\  <  -\Pl\  <  -\^l\  <  we  tested  the  null 

hypotheses  in  the  order:  Using  the  integration  algorithm  for 

multivariate  normal  probabilities  in  Schervish(1984,  1985),  we  have 

P{imn{Zr,Z2,Zs,Z4,Zs,Ze}  <  -3.910)  »  0.0002 
P(min{Z2,Z3,.^4,-^5,^6}  <  -3.477)  w  0.0012 
P{mm{Z3,  Z4,  Z5,  Ze}  <  -2.629)  Ri  0.0279 
P(min{Z4,  .^5,-^6}  <  -1.721)  «  0.1230. 

Assuming  a  two-sided  a  =  0.05,  these  results  suggest  that  ganciclovir  has  a  greater 
toxic  effect  on  ANC  than  foscarnet,  across  all  states  of  toxicity.  However,  the  evidence 
is  insufficient  to  conclude  that  the  toxicity  profiles  for  HGB  are  different  for  the  two 
treatments. 


4  Conclusions 

Although  the  proposed  methods  were  applied  to  data  from  a  clinical  trial  of  two  treat¬ 
ments  for  CMV  retinitis,  they  are  also  applicable  to  longitudinal  studies  of  other  dis¬ 
eases,  such  as  breast  cancer,  in  which  several  clinical  variables  are  periodically  moni¬ 
tored  over  time  on  each  subject. 

A  key  assumption  of  the  methods  is  that  transition  rates  between  states  do  not 
change  over  time.  This  time-homogeneous  Markov  assumption  is  quite  restrictive, 
however,  and  may  in  part  explain  the  apparent  lack  of  fit  of  the  model.  An  alternative 
approach  is  to  assume  that  the  intensities  are  piecewise  constant  over  distinct  time 
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intervals  (Gentleman  et  al.,  1994).  Kalbfleisch  and  Lawless  (1985)  also  suggested  fitting 
a  parametric  time-dependent  model. 

In  addition,  we  have  assumed  a  progressive  Markov  model,  in  which  transitions 
between  states  can  only  occur  in  one  direction.  The  proposed  methodology  can  in 
principle  be  extended  to  the  situation  where  arbitrary  transitions  are  allowed,  by  mod¬ 
elling  each  process  marginally  using  the  methods  in  Kalbfleisch  and  Lawless  (1985) 
and  deriving  the  joint  covariance  matrix  estimate  of  all  model  parameters  empirically. 
However,  because  a  larger  number  of  parameters  are  required  to  specify  the  reversible 
model  compared  with  the  progressive  model,  estimation  of  the  parameters  may  be  more 
difficult. 
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Table-l.  Estimated  parameters  and  mean  waiting  times  in  eaeh  state  of  toxicity 
_ for  Absolute  neutrophil  count 


Stage 

(A±S.D.,/9±S.D.) 

Mean  waiting  time 
for  Fos.(days) 

Mean  waiting  time 
for  Gov.  (days) 

1 

(.0070±.0013,  .823±.237) 

(143  ±  27) 

(63  ±  8) 

2 

(.0059±.0014,  L03±.263) 

(169  ±  40) 

(61  ±  8) 

3 

(.0065±.0020,  .861±.327) 

(154  ±  47) 

(65  ±  8) 

TabIe-2. 

Estimated  parameters  and 
for  Hemoglobin  count 

mean  waiting  times  in 

each  state  of  toxicity 

Stage 

(A±S.D.,i3±S.D.) 

Mean  waiting  time 
for  Fos.(days) 

Mean  waiting  time 
for  Gov.(days) 

1 

(.0139±.0027,  -.505±.293) 

(72  ±  14) 

(119  ±  26) 

2 

(.0098±.0018,  -.273±.263) 

(102  ±  19) 

(135  ±  25) 

3  (.0062d:.0013, 0.133d:.311)  (161  ±  34) 


(135  ±  25) 
(142  ±  32) 


Probability  Probability 
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Figure  1.  C.D.F.  and  Kaplan-Meier  Estimates  of 
Time  to  Severe  Toxicity  of  ANC 
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Figure  2.  C.D.F.  and  Kaplan-Meier  Estimates  of . 
Time  to  Severe  Toxicity  of  HGB 
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The  authors  present  a  technique  for  correcting  for  exposure  measurement  error  in  the  analysis  of  case- 
control  data  when  subjects  have  a  variable  number  of  repeated  measurements,  and  the  average  is  used  as  the 
subject’s  measure  of  exposure.  The  true  exposure  as  well  as  the  measurement  error  are  assumed  to  be 
normally  distributed.  The  method  transforms  each  subject’s  observed  average  by  a  factor  which  is  a  function 
of  the  measurement  error  parameters,  prior  to  fitting  the  logistic  regression  model.  The  resulting  logistic 
regression  coefficient  estimate  based  on  the  transformed  average  is  corrected  for  error.  A  bootstrap  method 
for  obtaining  confidence  intervals  for  the  true  regression  coefficient,  which  takes  into  account  the  variability 
due  to  estimation  of  the  measurement  error  parameters,  is  also  described.  The  method  is  applied  to  data  from 
a  nested  case-control  study  of  hormones  and  breast  cancer.  Am  J  Epidemiol  1997:145:1003-10. 

epidemiologic  methods;  breast  neoplasms;  hormones:  measurement  error;  statistics 


In  most  case-control  studies,  the  risk  factors  of 
interest  are  measured  with  error.  For  biologic  vari¬ 
ables,  such  as  blood  pressure,  nutrients,  and  hormone 
levels,  measurement  error  can  arise  from  limitations  in 
the  measurement  technique  or  laboratory  assay.  In 
addition,  because  the  exposure  of  interest  is  usually  a 
subject’s  underlying  long-term  average  value  rather 
than  the  level  at  any  single  point  in  time,  intrinsic 
fluctuations  in  the  variable  over  time  can  also  contrib¬ 
ute  to  measurement  error. 

When  the  error  is  random  and  nondifferential  with 
respect  to  case-control  status,  it  is  well  known  that 
estimates  of  relative  risk  based  on  the  mis-measured 
exposure  will  be  attenuated.  In  order  to  minimize  the 
effects  of  measurement  error,  many  investigators  ad¬ 
vocate  collecting  repeated  measurements  of  the  expo¬ 
sure  on  all  subjects  and  using  the  individual’s  average 
value  (1).  However,  as  noted  by  Rosner  et  al.  (2),  even 
when  the  mean  of  several  replicates  is  substituted  for 
a  single  measurement,  attenuation  of  relative  risks 
may  still  occur,  especially  when  the  degree  of  mea¬ 
surement  error  is  large  and  the  average  is  based  on 
only  a  few  repeats. 

Methods  for  correcting  estimates  of  relative  risk  for 
measurement  error  have  been  proposed  in  a  number  of 
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epidemiologic  and  statistical  papers  (3,  4).  The  most 
common  method  involves  correcting  the  “naive”  rel¬ 
ative  risk  estimate  based  on  the  observed  exposure  by 
the  expected  amount  of  bias.  In  the  case  of  logistic 
regression,  the  regression  parameter  will  be  attenuated 
by  the  factor,  R,  which  is  equal  to  the  reliability 
coefficient  of  the  mis-measured  exposure  (1,  2). 
Therefore,  one  can  multiply  the  biased  estimate  of  the 
regression  coefficient  by  the  inverse  of  the  reliability 
coefficient  to  obtain  the  corrected  estimate.  This 
method,  however,  is  dependent  on  the  assumption  that 
the  reliability  of  the  exposure  measurement  is  the  same 
for  all  subjects.  When  the  average  of  several  replicates 
is  used  as  the  measure  of  exposure,  this  condition  will 
be  met  only  if  all  subjects  have  an  equal  number  of 
repeated  measurements,  given  that  the  degree  of  mea¬ 
surement  error  associated  with  a  single  measurement 
is  the  same  for  all  subjects. 

In  studies  in  which  the  exposure  is  measured  on 
repeated  occasions,  however,  subjects  often  have  a 
variable  number  of  measurements  because  of  missing 
data.  For  example,  the  data  that  are  utilized  to  illustrate 
the  methods  in  this  paper  are  derived  from  a  nested 
case-control  study  of  serum  hormonal  levels  and 
breast  cancer  from  the  New  York  University  (NYU) 
Women’s  Health  Study  (5).  The  study  cohort  consists 
of  14,275  women  who  donated  multiple  blood  samples 
over  time  and  have  been  followed  since  enrollment  for 
the  development  of  breast  cancer.  Most  women  have 
donated  one  or  two  samples;  however,  many  have  also 
donated  three  or  more  samples.  Because  subjects  with 
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a  larger  number  of  multiple  blood  samples  have  a 
more  reliable  estimate  of  their  true  underlying  serum 
hormonal  levels  than  subjects  with  fewer  measure¬ 
ments,  the  reliability  of  the  measured  exposure  will 
not  be  constant  across  individuals.  Consequently,  the 
usual  procedure  for  correcting  for  measurement  error 
cannot  be  applied. 

Liu  and  Liang  (6)  proposed  an  estimating  equation 
approach  for  obtaining  consistent  estimates  of  logistic 
regression  parameters  when  all  subjects  have  the  same 
number  of  repeated  imprecise  exposure  measure¬ 
ments,  which  in  principle  could  be  extended  to  the 
more  complicated  situation  when  the  number  of  rep¬ 
licates  is  variable  between  subjects.  In  this  paper,  we 
discuss  an  alternative  method  for  correcting  for  mea¬ 
surement  error  in  the  analysis  of  matched  case-control 
data  when  subjects  have  a  variable  number  of  repeated 
exposure  measurements  and  the  individual’s  average 
is  used  as  the  measure  of  exposure.  The  technique, 
which  assumes  that  both  the  true  exposure  and  the 
measurement  error  are  normally  distributed,  involves 
multiplying  each  subject’s  observed  average  by  the 
reliability  of  the  average  prior  to  fitting  the  logistic 
regression  model.  The  resulting  logistic  regression  co¬ 
efficient  based  on  the  transformed  average  is  corrected 
for  measurement  error.  A  bootstrap  algorithm  for  ob¬ 
taining  confidence  intervals  for  the  regression  param¬ 
eter  which  takes  into  account  the  variability  due  to 
estimation  of  the  reliability  coefficient  is  also  pro¬ 
posed. 

METHODS 

Measurement  error  model  and  correction  of 
logistic  regression  parameter 

The  methods  described  below  are  based  on  the  mea¬ 
surement  error  model  of  Armstrong  et  al.  (7)  for 
matched  case-control  studies.  We  assume  that  in  each 
matching  stratum,  a  case  is  matched  to  a  variable 
number  of  controls.  However,  the  techniques  are  gen- 
eralizable  to  the  unmatch^  design  by  assuming  that 
there  is  only  one  matching.  Stratum. 

Let  Xijk  denote  the  unobserved  true  value  of  the 
exposure  variable  for  the  /dh  subject  with  case/control 
status  j  (0  =  control,  1  =  case),  in  stratum  /  (/  =  1, . . ., 
M).  Assume  that  is  normally  distributed  with 
mean,  /x,-  +  j8,  and  variance  o^.  In  addition,  let  Zijki 
denote  the  /th  observed  value  of  measured  with 
error,  for  /  =  1,  . . .,  n,;;*.  We  assume  the  following 
classical  errors-in- variables  model: 

Zijkl  Xjjk  "I"  €jjkii 

where  the  error  term,  is  independent  of  Xyk  and 
Cijkr,  for  /  ¥=  and  normally  distributed  with  mean  0 


and  variance,  o^.  It  follows  that  the  observed  Zijki  in 
stratum  i  are  normally  distributed  with  means  p,,-  -l-  8 
and  jx,  for  cases  and  controls,  respectively,  and  com¬ 
mon  variance,  +  cr^.  The  variance  component,  o^, 
can  be  interpreted  as  the  variance  of  the  true  exposure, 
after  stratifying  by  matching  stratum  and  case/control 
status,  and  as  the  variance  due  to  measurement 
error. 

With  these  assumptions  and  the  application  of 
Bayes’  rule,  Armstrong  et  al.  (7)  showed  that  the 
probability  that  a  study  subject  is  a  case,  conditional 
on  z„,  the  observed  average  based  on  n  measurements, 
and  membership  in  stratum  i,  is  a  logistic  function: 


Pr(D  =  llz„;  0  = 


expCa,  -I-  ^R„z„) 

1  -h  exp(a,  +  ^R„z,y 


(I) 


where 


R  -  — 


(2) 


is  the  reliability  of  z„  as  a  measure  of  x.  When  no 
measurement  error  is  present,  z„  =  x,  the  reliability 
coefficient  is  equal  to  1,  and  equation  1  reduces  to: 


Pr(D  =  I  lx;  i) 


exp(a,  +  jSx) 

1  exp(a,  +  fix) ' 


Thus,  an  estimate  of  the  logistic  regression  coefficient 
based  on  z„  will  estimate  the  “naive”  coefficient, 
J3*  =  PR„,  rather  than  the  true  j3.  Because  the  reliabil¬ 
ity  coefficient  is  between  0  and  1,  the  “naive”  /3*  will 
be  attenuated  relative  to  We  can  see  from  equation 
2,  however,  that  as  the  number  of  repeated  measure¬ 
ments  increases,  the  reliability  coefficient  approaches 
one,  and  the  corresponding  attenuation  in  /3  will  di¬ 
minish. 

When  all  subjects  have  the  same  number  of  n  re¬ 
peated  measurements,  an  estimate  of  the  true  regres¬ 
sion  coefficient  can  be  obtained  by  fitting  the  logistic 
model  using  z„  for  each  subject’s  exposure  measure¬ 
ment,  pd  multiplying  the  resulting  coefficient  esti¬ 
mate,  p*,  by  l/R„.  If  subjects  have  a  variable  number 
of  measurements,  however,  this  approach  cannot  be 
applied,  since  the  reliability  of  the  exposure  variable  is 
no  longer  constant  for  all  subjects,  but  depends  on  the 
number  of  available  repeated  measurements. 

For  the  case  where  the  reliability  of  the  exposure 
differs  across  subjects,  a  corrected  estimate  of  the 
regression  coefficient  may  be  obtained  by  multiplying 
each  subject’s  average  exposure  measurement  by  the 
reliability  of  the  average,  prior  to  model  fitting.  That 
is,  if  the  )tth  subject  in  stratum  i  has  the  observed 
average  Zijn: . ,  based  on  nyf.  approximate  measurements 
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of  Xij^,  then  replacing  the  unknown  in  the  condi¬ 
tional  logistic  model  with  the  transformed  average, 

■  *  where  R  is  calculated  from  equation  2,  will 
yield  an  estimate  of  the  true  /3.  Because  the  reliability 
increases  with  the  number  of  measurements,  this  trans¬ 
formation  results  in  greater  “shrinkage”  of  averages 
based  on  a  small  number  of  repeats  and  less  shrinkage 
of  more  informative  averages  based  on  many  repeats. 

When  all  subjects  within  the  same  matched  set  have 
the  same  number  of  repeats,  this  method  is  equivalent 
to  the  two-stage  approach  proposed  by  Thomas  et  al. 
(4)  and  Whittemore  (8)  for  error  correction  in  linear 
models,  in  which  Eix^^Zyk.)  is  computed  and  then 
used  as  the  exposure  in  fhe  usual  regression  model. 
Given  the  model  assumptions  described  above, 

E’i.Xy^Zijk.)  Rriijt^iJk-  “i”  (1  E{Xiji^ . 

If  n,y<.  is  constant  for  all  subjects  in  stratum  i,  the 
(1  -  R„JE(xij^)  term  is  absorbed  in  the  intercept  term 
and  does  not  affect  the  estimate  of  the  slope  parameter 
in  the  logistic  regression  model.  Thus,  utilizing 
E'(Xijk\^ijk  •  )  .  will  yield  equivalent  estimates 

of  the  true  regression  parameter.  Furthermore,  when 
all  subjects  in  the  study  have  the  same  number  of  n 
repeats,  this  technique  will  result  in  a  corrected  esti¬ 
mate  of  the  logistic  regression  coefficient  that  is  iden¬ 
tical  to  the  one  obtained  by  correcting  the  naive  esti¬ 
mate  by  !//?„. 

Although  fitting  the  logistic  model  to  the  trans¬ 
formed  covariate  will  result  in  an  unbiased  estimate  of 
/3,  the  corresponding  variance  of  ^  will  be  underesti¬ 
mated  unless  the  variance  components  in  the  reliability 
coefficient  are  known.  Usually,  however,  the  variance 
components  are  estimated  from  a  separate  reliability 
substudy  or  from  subjects  in  the  main  study.  In  our 
setting,  repeated  measurements  are  assumed  to  be 
available  on  all  or  a  subset  of  the  main  study  partici¬ 
pants.  Thus,  we  can  estimate  the  variance  components, 
and  of,  from  the  main  study  data  by  fitting  the 
following  mixed  effects  analysis  of  variance  model  to 
data  on  all  cases  and  controls: 

Zijki  =  +  8j  +  +  €ijki,  (3) 

where  Zyu  is  defined  as  before,  /x,-  is  the  effect  for 
stratum  i,  8j  is  the  effect  due  to  case/control  status,  yy/. 
is  a  normally  distributed  random  subject  effect  with 
mean  0  and  variance  o^,  and  ey^i  is  the  residual  error 
which  is  normally  distributed  with  mean  0  and  vari¬ 
ance  o^. 

The  variance  components,  and  o^,  can  be  esti¬ 
mated  using  one  of  several  methods,  including  tradi¬ 
tional  analysis  of  variance  (ANOVA),  maximum  like- 
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lihood  (ML)  or  restricted  maximum  likelihood 
(REML)  methods.  The  ANOVA  method,  available  in 
the  SAS  procedure,  PROC  GLM  (SAS  Institute,  Cary, 
North  Carolina),  was  used  in  our  example  because  it  is 
computationally  simpler  than  the  other  methods.  This 
is  an  important  consideration  when  implementing  the 
bootstrap  procedure  that  we  describe  in  the  next  sec¬ 
tion  for  generating  confidence  intervals.  However,  the 
ANOVA  method  can  lead  to  negative  variance  esti¬ 
mates.  The  ML  or  REML  estimators,  which  are  avail¬ 
able  from  PROC  MIXED  in  SAS,  do  not  have  this 
limitation.  For  further  details  about  the  different  esti¬ 
mation  techniques,  see  Searle  et  al.  (9). 

The  steps  involved  in  obtaining  an  estimate  of  the 
logistic  regression  coefficient  corrected  for  measure¬ 
ment  error  can  be  summarized  as  follows: 

1 .  Estimate  the  variance  components,  and  o^,  by 
fitting  the  mixed  effects  analysis  of  variance 
model  in  equation  3  to  the  study  data. 

2.  Multiply  each  subject’s  average  exposure  by 

3.  Estimate  the  true  logistic  regression  coefficient, 
/3,  by  fitting  a  conditional  logistic  regression 
model  to  the  transformed  averages. 

Because  the  technique  is  based  on  assuming  that  the 
true  exposure  and  measurement  error  are  normally 
distributed,  suitable  data  transformations  should  be 
applied  when  the  distributions  deviate  from  normality. 
Note,  however,  that  a  data  transformation  such  as  the 
log-transform  will  result  in  a  model  in  which  the  log 
odds  of  disease  is  a  linear  function  of  the  exposure 
measured  on  the  log  rather  than  the  original  scale. 


Bootstrap  method  for  obtaining  confidence 
intervals 

The  width  of  the  usual  95  percent  confidence  inter¬ 
val  for  the  true  j3  based  on  the  transformed  covariate 
will  be  too  narrow  because  the  interval  does  not  ac¬ 
count  for  the  extra  variability  associated  with  estima¬ 
tion  of  the  variance  components  in  R„.  Rosner  et  al.  (2) 
have  derived  the  asymptotic  variance  and  correspond¬ 
ing  confidence  intervtds  of  the  corrected  logistic  re¬ 
gression  parameter  which  includes  the  uncertainty  of 
the  variance  estimates  for  use  in  cohort  studies  under 
a  rare  disease  assumption.  Their  method,  however,  is 
applicable  only  when  all  subjects  in  the  main  study 
have  the  same  number  of  repeats.  For  the  situation 
when  subjects  in  a  matched  case-control  study  have  a 
variable  number  of  replicates,  we  propose  the  follow- 
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ing  bootstrap  algorithm  for  obtaining  confidence  in¬ 
tervals  for  the  true  /3: 

1.  Assuming  there  are  M  matched  sets  in  the  main 
study,  generate  a  bootstrap  sample  using  the 
matching  stratum  as  the  sampling  unit,  and  sam¬ 
pling  M  matched  sets  with  replacement  from  the 
main  study  data.  For  each  matched  set  that  is 
selected,  the  sample  contains  all  the  subjects 
within  the  matched  set,  along  with  each  subject’s 
case/control  status  and  repeated  measurements. 

2.  Using  the  bootstrap  sample,  estimate  o^,  o^,  and 
the  true  /3  by  following  the  three-step  approach 
outlined  in  the  previous  section. 

3.  Repeat  steps  1)  and  2)  1,000  times,  which  is  the 
approximate  minimum  number  of  bootstraps 
necessary  to  compute  bias-corrected  confidence 
limits  (10). 

In  constructing  a  bootstrap  sample  from  the  main 
study  data,  sampling  occurs  at  the  level  of  the  match¬ 
ing  stratum  because  the  matching  between  the  cases 
and  the  controls  needs  to  be  preserved.  If  the  number 
of  controls  matched  to  each  case  is  variable  across 
strata,  one  can  sample  the  strata  according  to  the 
number  of  subjects  in  each  matched  set,  in  order  to 
keep  the  total  sample  size  constant  for  each  bootstrap 
iteration.  For  example,  one  samples  with  replacement 
matched  sets  from  the  Afj  sets  in  the  main  study 
with  two  controls  per  case,  M3  sets  from  the  strata  with 
three  controls  per  case,  etc. 

The  simple  (1  —  a)%  confidence  interval  can  be 
constructed  using  the  a/2  and  (1  —  a/2)  percentiles  of 
the  bootstrap  distribution.  Bias-corrected  confidence 
intervals  should  be  used  when  the  bootstrap  distribu¬ 
tion  of  p  is  asymmetric  and  when  the  sample  size  is 
small  (10).  We  report  only  the  bias-corrected  confi¬ 
dence  intervals  in  this  paper. 

Thus  far,  our  focus  has  been  on  correcting  for  mea¬ 
surement  error  in  a  single  exposure  variable,  in  the 
absence  of  confounders.  However,  the  methods  can 
also  be  generalized  to  the  multi-covariate  situation, 
where  the  confounders,  in  addition  to  the  primary 
exposure  variable,  may'  be  measured  with  error.  A 
brief  outline  of  the  methods  is  given  in  the  Appendix. 
Additional  details  on  the  measurement  error  model 
and  estimation  of  variance  components  are  also  de¬ 
scribed  in  Armstrong  et  al.  (7). 

EXAMPLE 

The  primary  aim  of  the  NYU  Women’s  Health 
Study  is  to  determine  whether  serum  levels  of  endog¬ 
enous  hormones,  such  as  estradiol,  are  associated  with 
risk  of  breast  cancer.  Between  March  1985  and  June 
1991,  a  cohort  of  14,275  healthy  women  aged  34-65 


years  were  enrolled  at  the  Guttman  Breast  Diagnostic 
Institute  in  New  York  City.  At  the  time  of  enroll¬ 
ment  and  at  annual  screening  visits  thereafter,  women 
were  asked  to  donate  blood  and  complete  a  self- 
administered  questionnaire.  Serum  samples  were  fro¬ 
zen  and  stored  for  future  biologic  assays.  Subsequent 
cases  of  breast  cancer  were  identified  primarily 
through  active  follow-up  and  confirmed  by  reviewing 
medical  and  pathologic  records.  In  this  example,  only 
the  women  who  were  postmenopausal  at  enrollment 
(49  percent)  were  included. 

In  order  to  limit  the  costs  associated  with  measuring 
hormone  levels  in  the  cohort,  a  nested  case-control 
study  design  was  used.  For  each  incident  case  of  breast 
cancer,  individually  matched  controls  were  selected  at 
random  from  the  risk  set  consisting  of  all  cohort  mem¬ 
bers  alive  and  free  of  breast  cancer  at  the  time  of 
diagnosis  of  the  case,  and  who  matched  the  case  on 
menopausal  status  at  entiy,  age  at  entry,  and  number 
and  approximate  dates  of  blood  donations  up  to  the 
case’s  date  of  diagnosis.  For  additional  details  of  the 
study  design,  see  Toniolo  et  al.  (5). 

The  goal  of  this  example  is  to  evaluate  the  effect  of 
random  measurement  error  on  the  associations  be¬ 
tween  total  estradiol,  %  free  estradiol,  and  %  estradiol 
bound  to  sex  hormone-binding  globulin  (SHBG)  and 
risk  of  breast  cancer,  when  the  average  of  all  the 
available  repeated  measurements  for  a  subject  is  used 
as  her  exposure.  The  associations  between  the  baseline 
measurements  of  the  total  estradiol,  %  free  estradiol, 
and  %  estradiol  bound  to  SHBG  and  risk  of  breast 
cancer  among  postmenopausal  women,  unadjusted  for 
measurement  error,  were  evaluated  by  Toniolo  et  al. 
(11).  Total  estradiol  and  %  free  estradiol  were  found  to 
be  positively  associated  with  risk  of  breast  cancer, 
whereas  %  estradiol  bound  to  SHBG  had  a  strong 
protective  effect. 

One  of  the  assumptions  of  the  measurement  error 
model  is  that  the  true  and  observed  exposure  variables 
are  normally  distributed.  The  distribution  of  total  es¬ 
tradiol  levels  was  skewed,  so  the  logarithms  of  the 
values  were  used.  Based  on  data  from  both  postmeno¬ 
pausal  cases  and  controls,  we  estimated  the  reliability 
coefficients  for  total  estradiol,  %  free  estradiol,  and  % 
estradiol  bound  to  SHBG,  adjusted  for  matching  stra¬ 
tum  and  case/control  status,  as  0.48,  0.68,  and  0.92, 
respectively  (table  1).  (These  estimates  were  some¬ 
what  lower  than  the  estimates  published  by  Toniolo  et 
al.  (12),  0.51, 0.77,  and  0.94  for  total  estradiol,  %  free 
estradiol,  and  %  estradiol  bound  to  SHBG,  respec¬ 
tively,  based  on  data  from  only  the  postmenopausal 
controls  in  the  NYU  Women’s  Health  Study.)  The 
estimates  of  the  reliability  coefficients  indicate  that  the 
degree  of  measurement  error  in  total  estradiol  and  % 
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TABLE  1.  Reproducibility  of  total  estradiol,  %  free  estradiol 
and  %  estradiol  bound  to  sex  hormone-binding  globulin 
(SHBG),  adjusted  for  case/control  status  and  matching 
stratum  in  a  nested  case-control  study  of  serum  hormone 
levels  and  breast  cancer:  New  York  University  Women’s 
Health  Study  (5) 


Hormone 

Within-subject 

variance 

Between-subject 

variance 

Reliability 

coefficient 

Total  estradiol 

0.17 

0.16 

0.48 

%  free  estradiol 

0.017 

0.036 

0.68 

%  SHBG-bound 
estradiol 

9.38 

104.45 

0.92 

free  estradiol  may  be  sufficiently  large  to  attenuate 
observed  relations  with  risk  of  breast  cancer. 

The  main  case-control  study  sample  consisted  of 
381  subjects  stratified  into  130  matched  sets.  Ten 
matched  sets  had  one  control  per  case,  119  sets  had 
two  controls  per  case,  and  one  set  had  three  controls 
per  case.  Of  the  381  subjects  in  the  main  study,  157 
(41  percent)  had  two  or  more  repeated  measurements, 
i.e.,  98  subjects  had  two  replicates,  53  had  three  rep¬ 
licates,  and  six  subjects  had  four  replicates. 

We  investigated  the  effects  of  measurement  error  on 
the  observed  associations  between  each  exposure  vari¬ 
able  and  risk  of  breast  cancer  by  comparing  the  esti¬ 
mated  logistic  regression  parameters  based  on  the  first 
measurement  of  the  exposure  for  each  subject,  the 
average  of  the  replicate  measures,  and  the  transformed 
(corrected)  average  value.  Corresponding  odds  ratios 
were  calculated  from  the  regression  estimates  by  com¬ 
paring  women  in  the  90th  versus  10th  percentiles  of 
the  observed  distributions  (i.e.,  63.0  vs.  14.5  for  total 
estradiol,  1.7  vs.  1.04  for  %  free  estradiol,  and  57.6  vs. 
27.3  for  %  estradiol  bound  to  SHBG). 

The  bootstrap  confidence  intervals  were  generated 
using  the  SAS  macro  facility  to  create  the  bootstrap 
sample,  in  conjunction  with  PROC  PHREG,  which  fits 
conditional  logistic  regression  models.  All  analyses 
were  run  on  a  DEC  30(X)/700  AXP  computer  work¬ 
station  (Digital  Equipment  .Corporation,  Maynard, 
Massachusetts).  •  . 

The  results  are  shown  in  table  2.  For  total  estradiol 
and  %  free  estradiol,  the  uncorrected  analyses  show 
that  using  the  observed  average  of  the  repeated  mea¬ 
surements  results  in  a  minor  increase  in  the  regression 
coefficient  estimates  compared  with  using  only  the 
baseline  measurement.  On  the  other  hand,  the  esti¬ 
mated  regression  coefficients  corrected  for  measure¬ 
ment  error  using  the  transformed  averages  are  substan¬ 
tially  larger  than  the  estimates  based  on  the  observed 
averages  for  both  variables:  increases  are  74  percent 
and  40  percent  for  total  estradiol  and  %  free  estradiol, 
respectively. 


TABLE  2.  Corrected  and  uncorrected  logistic  regression 
parameter  estimates,  confidence  intervals,  and  odds  ratios 
for  the  associations  of  total  estradiol,  %  free  estradiol  and  % 
estradiol  bound  to  sex  hormone-binding  globulin  (SHBG)  with 
risk  of  breast  cancer  in  a  nested  case-control  study  of  serum 
hormonal  levels  and  breast  cancer:  New  York  University 
Women’s  Health  Study  (5) 


Exposure  variable 

Regression 

coefficient 

95%  confidence 
interval* 

Odds 

ratiot 

Total  estradiolt 

First  measurement 

0.66 

0.24  to  1.09 

2.64 

Average 

0.77 

0.32  to  1.22 

3.10 

Transformed  average 

1.34 

0.61  to  2.47 

7.16 

%  free  estradiol 

First  measurement 

1.70 

0.69  to  2.71 

3.07 

Average 

1.73 

0.70  to  2.77 

3.13 

Transformed  average 

2.42 

1.06  to  4.00 

4.95 

%  SHBG-bound 
estradiol 

First  measurement 

-0.046 

-0.068  to  -0.024 

0.25 

Average 

-0.045 

-0.067  to  -0.023 

0.26 

Transformed  average 

-0.048 

-0.074  to  -0.025 

0.24 

*  95%  confidence  intervals  using  transformed  average  based  on 
bias-corrected  bootstrap  estimate. 

t  Comparing  women  at  90th  vs.  10th  percentile  of  observed 
distribution. 

T  Total  estradiol  measurements  were  log-transformed. 


The  effect  of  measurement  error  on  the  estimated 
odds  ratios  is  especially  striking.  When  comparing 
women  in  the  90(h  percentile  versus  the  10th  percen¬ 
tile  of  the  observed  total  estradiol  distribution,  the 
corrected  odds  ratio  was  estimated  to  be  7.16,  com¬ 
pared  with  uncorrected  odds  ratios  of  2.64  and  3.10 
using  the  baseline  and  untransformed  average,  respec¬ 
tively.  Similarly,  the  corrected  odds  ratio  for  %  free 
estradiol  was  4.95,  compared  with  3.07  for  the  base¬ 
line  measurement  and  3.13  for  the  average  value. 

This  illustrates  how  using  the  observed  average  of 
replicate  measurements  of  exposure  for  each  subject 
may  not  be  sufficient  to  offset  the  effects  of  measure¬ 
ment  error  when  the  degree  of  error  is  large  and  when 
subjects  have  only  a  few  replicates,  and  that  additional 
error  correction  procedures  may  be  necessary.  In  the 
case  of  total  estradiol,  one  would  need  to  take  the 
average  of  10  replicate  measurements  to  improve  the 
reliability  to  0.90,  based  on  the  estimated  variance 
components  in  table  1.  For  %  free  estradiol,  one  would 
need  five  measurements.  Thus,  it  is  not  surprising  that 
using  the  average  value  in  our  example  did  not  appre¬ 
ciably  deattenuate  the  corresponding  regression  coef¬ 
ficient,  because  only  41  percent  of  the  study  subjects 
had  replicate  measurements,  and,  among  these,  most 
had  only  two  or  three  measurements.  Using  the  aver¬ 
age  resulted  in  a  17  percent  increase  in  the  regression 
coefficient  for  total  estradiol,  relative  to  using  the  first 
measurement.  In  comparison,  if  all  subjects  had  two 
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replicates,  the  expected  increase  in  the  regression  co¬ 
efficient  would  be 

(/?2  -  RiVRi  =  (0.65  -  0.48)/0.48  =  37% 

over  the  estimate  based  on  one  measurement.  On  the 
other  hand,  because  levels  of  %  estradiol  bound  to 
SHBG  are  highly  reproducible,  the  logistic  regression 
estimates  and  corresponding  odds  ratios  using  the  cor¬ 
rected  average  were  not  very  different  from  the  uncor¬ 
rected  analyses. 

Because  1 19  (92%)  of  the  130  matched  sets  had  two 
controls  matched  per  case,  implementation  of  a  more 
complex  stratified  bootstrap  sampling  scheme,  which 
would  keep  the  total  number  of  subjects  constant  for 
each  iteration,  was  not  warranted.  As  one  would  ex¬ 
pect,  the  bias-corrected  bootstrap  confidence  intervals 
based  on  the  transformed  average,  as  shown  in  table  2, 
are  shifted  further  away  from  0  and  are  wider  than  the 
uncorrected  confidence  intervals  for  all  variables,  be¬ 
cause  the  bootstrap  method  accounts  for  the  variation 
due  to  estimation  of  the  variance  components  in  the 
reliability  coefficient.  When  the  variation  in  the  esti¬ 
mates  of  variance  components  estimates  was  ignored, 
the  simple  95  percent  confidence  intervals  based  on 
the  corrected  average  were  estimated  to  be  0.54  to 
2.13,  1.02  to  3.82,  and  ~0.074  to  —0.025  for  total 
estradiol,  %  free  estradiol,  and  %  estradiol  bound  to 
SHBG,  respectively.  Thus,  ignoring  the  extra  source  of 
variation  from  R„^^^  underestimated  the  width  of  the  con¬ 
fidence  interval  by  as  much  as  17  percent  (for  total 
estradiol)  in  our  data  set. 


DISCUSSION 

In  most  reliability  studies,  the  within-subject  or 
error  variance  of  the  exposure  is  estimated  from  an 
external  population  or  from  a  random  subset  of  the 
main  study  population  from  whom  repeated  measure¬ 
ments  are  obtained,  and  one  must  assume  that  the 
resulting  estimate  is  generalizable  to  the  main  study 
population.  In  our  example,  the  within-subject  vari¬ 
ances  were  estimated  from  the  subjects  in  the  main 
study  with  at  least  two  repeated  hormone  measure¬ 
ments.  Women  with  repeated  measurements,  however, 
may  differ  from  women  with  only  one  measurement. 
Because  blood  samples  in  the  NYU  Women’s  Health 
Study  were  obtained  at  annual  breast  cancer  screening 
visits,  women  with  a  family  history  of  breast  cancer, 
for  example,  or  those  who  are  more  health  conscious, 
may  have  been  more  likely  to  return  for  subsequent 
visits.  It  is  unlikely,  though,  that  this  would  result  in  a 
systematic  difference  in  the  within-subject  variability 
of  the  hormone  levels  between  the  subset  with  repeats 


and  those  who  had  only  one  measurement.  Thus,  we 
can  assume  generalizability  of  the  estimated  within- 
subject  variance  to  all  subjects  in  our  main  study. 

A  second  assumption  of  our  error-correction  method 
is  that  a  subject’s  measurements  are  distributed  ran¬ 
domly  around  the  unobserved  true  value,  and  that 
levels  of  the  exposure  are  not  changing  systematically 
over  time.  This  assumption  may  not  be  true  if  hormone 
levels  decrease  with  age.  In  addition,  for  breast  cancer 
cases,  hormone  levels  could  be  influenced  by  the  de¬ 
velopment  of  disease  so  that  measurements  obtained 
closer  to  the  date  of  diagnosis  may  exhibit  a  systematic 
time  trend.  Among  subjects  in  the  NYU  Women’s 
Health  Study,  however,  a  trend  in  estradiol  levels  over 
time  was  not  observed  in  preliminary  analyses  using 
linear  regression  techniques  (results  not  shown). 

We  have  also  assumed  that  the  variance  compo¬ 
nents,  and  of,  are  homogeneous  across  strata  and 
case/control  status.  The  within-subject  variance  for 
total  estradiol  was  estimated  as  0.16  and  0.18  for  cases 
and  controls,  respectively,  indicating  that  the  error 
variances  are  similar  for  the  two  groups.  Because  only 
one  case  was  included  in  each  stratum,  we  could  not 
evaluate  whether  of  was  constant  for  cases  and  con¬ 
trols.  Furthermore,  assessing  whether  of  was  homo¬ 
geneous  across  strata  was  not  possible,  given  that  most 
strata  only  had  two  controls. 

The  error-correction  methods  in  this  paper  are  ap¬ 
plicable  to  studies  in  which  a  variable  number  of 
repeated  measurements  of  exposure  are  obtained  on 
subjects,  and  the  average  of  each  subject’s  measure¬ 
ments  is  used  as  the  exposure  variable.  In  principle,  a 
corrected  estimate  of  the  logistic  regression  coefficient 
could  also  be  obtained  by  utilizing  only  the  first  mea¬ 
surement  of  exposure  for  each  subject,  and  correcting 
the  resulting  estimate  by  the  reliability  of  a  single 
measure.  Although  this  method  is  much  simpler  than 
using  all  the  available  repeated  measurements  and 
applying  the  techniques  proposed  in  this  paper,  the 
estimate  based  on  a  single  measurement  will  not  as 
efficient.  For  example,  the  95  percent  bias-corrected 
bootstrap  confidence  interval  for  the  true  /3  using  only 
the  first  measurements  of  total  estradiol  was  0.52- 
2.73,  which  is  wider  by  19  percent  than  the  corre¬ 
sponding  interval  based  on  the  transformed  averages. 

Haukka  (13)  proposed  a  similar  bootstrap  method 
for  correcting  for  measurement  error  in  generalized 
linear  models  for  the  situation  when  the  “gold  stan¬ 
dard’’  is  known  for  the  exposure  measurement  and 
when  validation  (as  opposed  to  reproducibility)  data 
are  available.  When  compared  with  the  correction 
method  for  logistic  regression  proposed  by  Rosner  et 
al.  ( 14),  which  also  takes  into  account  the  variability  in 
R,  the  bootstrap  method  was  found  to  yield  wider 
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confidence  intervals  .for  peaked  and  skewed  measure¬ 
ment  error  distributions.  As  discussed  by  Haukka  (13), 
this  difference  may  result  because  the  bootstrap 
method  takes  better  account  of  the  measurement  error 
variance,  whereas  the  Rosner  et  al.  method  (14)  is 
based  on  a  first-order  Taylor  series  approximation, 
which  may  not  adequately  correct  confidence  intervals 
when  the  error  variance  is  large. 

We  have  shown  that  in  situations  when  the  magni¬ 
tude  of  measurement  error  is  large  and  subjects  have 
only  a  few  repeats,  using  the  average  of  the  available 
replicate  measurements  for  each  subject  may  not  be 
sufficient  to  adjust  for  the  measurement  error.  The 
methods  proposed  in  this  paper  can  be  applied  to 
provide  additional  correction  procedures  in  the  analy¬ 
sis  of  case-control  data  where  subjects  have  a  variable 
number  of  repeated  measures  of  the  exposure.  The 
advantage  of  our  algorithm  is  that  it  is  conceptually 
straightforward  and  relatively  easy  to  implement,  es¬ 
pecially  with  the  amount  of  computing  power  that  is 
now  readily  available  to  most  investigators. 


ACKNOWLEDGMENTS 

This  work  was  supported  by  grant  no.  DAMD17-94-J- 
4174  from  the  Department  of  Defense  Breast  Cancer  Re¬ 
search  Program  and  grant  nos.  CA- 16087  and  CA-34588 
from  the  National  Institutes  of  Health. 

The  authors  are  grateful  to  Drs.  Alice  Whittemore  and 
Stuart  Lipsitz  for  helpful  comments  on  the  manuscript. 


REFERENCES 

1.  De  Klerk  NH,  English  DR,  Armstrong  BK.  A  review  of  the 
effects  of  random  measurement  error  on  relative  risk  estimates 
in  epidemiological  studies.  Int  J  Epidemiol  1989;18:705-12. 

2.  Rosner  B,  Spiegelman  D,  Willett  WC,  Correction  of  logistic 
regression  relative  risk  estimates  and  confidence  intervals  for 
random  within-person  measurement  error.  Am  J  Epidemiol 
1992;136:1400-13. 

3.  Armstrong  BG.  The  effects  of  measurement  errors  on  relative 
risk  regressions.  Am  J  Epidemiol  1990;132:1176-84. 

4.  Thomas  D,  Stram  D,  Dwyer  J.  Exposure  measurement  error: 
influence  on  exposure-disease  relationships  and  methods  of 
correction.  Annu  Rev  Public  Health  1993;14:69-93. 

5.  Toniolo  P,  Pasternack  B,  Shore  R,  et  al.  Endogenous  hor¬ 
mones  and  breast  cancer:  a  prospective  cohort  study.  Breast 
Cancer  Res  Treat  1991;18:S23-S26. 

6.  Liu  X,  Liang  KY.  Efficacy  of  repeated  measures  in  regression 
models  with  measurement  error.  Biometrics  1992;48:645-54. 

7.  Armstrong  B,  Whittemore  A,  Howe  G.  Analysis  of  case- 
control  data  with  covariate  measurement  error:  application  to 
diet  and  colon  cancer.  Stat  Med  1989;8:1151-63. 

8.  Whittemore  AS.  Errors-in-variables  regression  using  Stein 
estimates.  Am  Statistician  1989;43:226-8. 

9.  Searle  SR,  Casella  G,  McCulloch  CE.  Variance  components. 
New  York:  John  Wiley  and  Sons,  1992. 

10.  Efron  B,  Tibshirani  R.  Bootstrap  methods  for  standard  errors. 


confidence  intervals,  and  other  measures  of  statistical  accu¬ 
racy.  Stat  Sci  1986;1:54-77. 

1 1.  Toniolo  P,  Levitz  M,  Zeleniuch-Jacquotte  A,  et  al.  A  prospec¬ 
tive  study  of  endogenous  estrogens  and  breast  cancer  in  post¬ 
menopausal  women.  J  Natl  Cancer  Inst  1995;87:190-7. 

12.  Toniolo  P,  Koenig  K,  Pasternack  B,  et  al.  Reliability  of 
measurements  of  total,  protein  bound,  and  unbound  estradiol 
in  serum.  Cancer  Epidemiol  Biomarkers  Prev  1994;3:47-70. 

13.  Haukka  JK.  Correction  for  covariate  measurement  error  in 
generalized  linear  models — a  bootstrap  approach.  Biometrics 
1995;51:1127-32. 

14.  Rosner  B,  Spiegelman  D,  Willett  WC.  Correction  of  logistic 
regression  relative  risk  estimates  and  confidence  intervals  for 
measurement  error:  the  case  of  multiple  covariates  measured 
with  error.  Am  J  Epidemiol  1990;132:734-45. 

15.  Kim  MY,  Pasternack  BP,  Carroll  RJ,  et  al.  Estimating  the 
reliability  of  an  exposure  variable  in  the  presence  of  confound- 
ers.  Stat  Med  1995;14:1437-46. 


APPENDIX 

In  order  to  generalize  the  techniques  to  the  multi¬ 
variate  situation,  assume  that  x^j  denotes  a  (p  X  1) 
vector  of  true  covariates  for  the  jih  subject  in  stratum 
/,  and  that  it  follows  a  multivariate  normal  distribution 
with  mean  vector  +  A  for  the  cases  and  for  the 
controls,  and  covariance  matrix  2.  In  addition,  let 

^ijk  ^ijk 

denote  the  kxh  observed  measurement  of  fork—  1, 
. . .,  rip  where  the  are  independent  and  identically 
distributed  according  to  a  multivariate  normal  distri¬ 
bution  with  covariance  matrix,  fl. 

Under  these  assumptions,  Armstrong  et  al.  (7) 
showed  that  the  probability  that  a  subject  is  a  case, 
conditional  on  the  mean  of  n  repeated  observations  of 
the  covariate  vectors,  4,  is  equal  to  the  following 
logistic  function: 


Pr(D=llz„.i)  = 


exp(a,  +  z„A„j3) 

1  +  exp(a,  +  z„A„j3)’ 


where 


z„=  A„  =  (E  +  n  'H)  '  '2, 

and  /3  is  the  (p  X  1)  vector  of  logistic  regression 
parameters. 

When  subjects  have  a  variable  number  of  replicate 
measures  of  the  exposure  variables,  it  follows  that  as 
in  the  single  covariate  case,  one  can  transform  the 
observed  mean  covariate  vector  for  each  subject  by 
multiplying  the  vector  by  an  estimate  of  the  matrix, 
A„ ,  and  then  fitting  the  usual  logistic  regression 
mdilel  to  the  transformed  covariates  to  obtain  the 
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corrected  logistic  regression- coefficients  for  all  covari¬ 
ates.  A  bootstrap  algorithm  analogous  to  that  for  the 
single  covariate  case  could  be  used  to  obtain  corrected 
confidence  intervals  which  take  into  account  the  vari¬ 
ation  due  to  estimation  of  A„  ,  but  the  method  could 
become  very  computationally"  intensive  with  a  large 
number  of  covariates,  since  more  complicated  multi¬ 


variate  analysis  of  variance  (MANOVA)  models 
would  be  needed  to  estimate  S  and  ft.  For  the  special 
case  when  only  a  single  covariate  is  measured  with 
error  and  the  others  are  measured  without  error,  how¬ 
ever,  estimation  of  the  variance  components  is  greatly 
simplified  (see  Kim  et  al.  (15)),  and  the  bootstrap 
method  can  be  more  easily  applied. 
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The  authors  examined  the  relation  between  postmenopausal  serum  levels  of  testosterone  and  dehydro- 
epiandrosterone  sulfate  (DHEAS)  and  subsequent  risk  of  breast  cancer  in  a  case-control  study  nested  within 
the  New  York  University  Women’s  Health  Study  cohort.  A  specific  objective  of  their  analysis  was  to  examine 
whether  androgens  had  an  effect  on  breast  cancer  risk  independent  of  their  effect  on  the  biologic  availability 
of  estrogen.  A  total  of  130  cases  of  breast  cancer  were  diagnosed  prior  to  1991  in  a  cohort  of  7,054 
postmenopausal  women  who  had  donated  blood  and  completed  questionnaires  at  a  breast  cancer  screening 
clinic  in  New  York  City  between  1985  and  1991 .  For  each  case,  two  controls  were  selected,  matching  the  case 
on  age  at  blood  donation  and  length  of  storage  of  serum  specimens.  Biochemical  analyses  were  performed 
on  sera  that  had  been  stored  at  -80°C  since  sampling.  The  present  report  includes  a  subset  of  85  matched 
sets,  for  whom  at  least  6  months  had  elapsed  between  blood  donation  and  diagnosis  of  the  case.  In  univariate 
analysis,  testosterone  was  positively  associated  with  breast  cancer  risk  (odds  ratio  (OR)  for  the  highest 
quartile  =  2.7,  95%  confidence  interval  (Cl)  1. 1-6.8,  p  <  0.05,  test  for  trend).  However,  after  including  % 
estradiol  bound  to  sex  hormone-binding  globulin  (SHBG)  and  total  estradiol  in  the  statistical  model,  the  odds 
ratios  associated  with  higher  levels  of  testosterone  were  considerably  reduced,  and  there  was  no  longer  a 
significant  trend  (OR  for  the  highest  quartile  =  1 .2,  95%  Cl  0.4-3.5).  Conversely,  breast  cancer  risk  remained 
positively  associated  with  total  estradiol  levels  (OR  for  the  highest  quartile  =  2.9,  95%  Cl  1. 0-8.3)  and 
negatively  associated  with  %  estradiol  bound  to  SHBG  (OR  for  the  highest  quartile  =  0.05, 95%  Cl  0.01-0.19) 
after  adjustment  for  serum  testosterone  levels.  These  results  are  consistent  with  the  hypothesis  that  testos¬ 
terone  has  an  indirect  effect  on  breast  cancer  risk,  via  its  influence  on  the  amount  of  bioavailable  estrogen.  No 
evidence  was  found  of  an  association  between  DHEAS  and  risk  of  breast  cancer  in  postmenopausal  women. 
Am  J  Epidemiol  1997;145;1030-8. 
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A  possible  role  of  androgens  in  the  development  of 
breast  cancer  in  postmenopausal  women  was  first  sug¬ 
gested  by  Grattarola  et  al.  (1).  Mechanisms  by  which 
androgens  may  increase  breast  cancer  risk  were  re¬ 
viewed  by  Secreto  et  al.  (2,  3)  and  Bernstein  and  Ross 
(4).  Androgens  may  act  directly,  by  stimulating  breast 
cell  proliferation  through  binding  to  androgen  recep¬ 
tors  or  by  stimulating  the  synthesis  of  growth  factors 
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inside  the  breast  epithelium.  Androgens  may  also  act 
indirectly  through  their  conversion  to  estrogens,  which 
are  known  to  stimulate  breast  cell  proliferation  (5): 
aromatization  of  androstenedione  and  testosterone  in 
peripheral  tissues  is  the  main  source  of  estrogens  in 
postmenopausal  women.  In  addition,  it  is  well  estab¬ 
lished  that  testosterone  binds  to  sex  hormone-binding 
globulin  (SHBG)  with  greater  affinity  than  estradiol. 
Testosterone  may  thus  indirectly  increase  the  risk  of 
breast  cancer  by  decreasing  the  fraction  of  estradiol 
bound  to  SHBG  and  thereby  increasing  the  nonbound 
fraction,  which  is  thought  to  be  the  fraction  available 
to  breast  cells  (6).  Finally,  it  has  been  suggested  that 
testosterone  inhibits  hepatic  secretion  of  SHBG  (7), 
which  could  also  result  in  a  decreased  fraction  of 
estradiol  bound  to  SHBG. 

Several  case-control  studies  have  reported  on  the 
association  of  plasma  or  serum  levels  of  testosterone 
with  risk  of  breast  cancer  in  postmenopausal  women. 
Most  such  studies  (2,  8-11,  Bruning  et  al.,  unpub- 
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lished  data)  although  not  all  studies  (12)  observed 
higher  levels  of  testosterone  in  cases  than  in  controls. 
Among  the  three  prospective  studies  which  examined 
the  relation  of  serum  levels  of  testosterone  with  risk  of 
postmenopausal  breast  cancer,  one  found  a  significant 
positive  association  (13),  one  found  a  nonsignificant 
positive  association  (14),  and  the  third  reported  no 
association  (15). 

Dehydroepiandrosterone  (DHEA)  is  the  androgen 
produced  by  the  adrenal  in  largest  quantity.  The  phys¬ 
iologic  roles  of  DHEA  and  of  its  sulfate  (DHEAS), 
which  is  thought  to  be  produced  exclusively  by  the 
adrenal  cortex  (16),  are  unknown.  They  are  considered 
weak  androgens,  but  also  appear  to  have  estrogenic 
properties  (17).  It  has  been  proposed  that  DHEA  and 
DHEAS  protect  against  breast  cancer  in  premeno¬ 
pausal  women,  but  increase  breast  cancer  risk  in  post¬ 
menopausal  women  (17, 18).  These  conflicting  actions 
could  be  reconciled  by  a  recent  hypothesis:  in  pre¬ 
menopausal  women,  DHEA  would  have  an  antiestro¬ 
genic  effect  by  binding  competitively  to  estrogen  re¬ 
ceptors,  whereas,  in  postmenopausal  women,  DHEA 
would  bind  to  vacant  estrogen  receptors  and  enhance 
estradiol-like  effects,  thereby  stimulating  tumor 
growth  (19). 

Results  from  case-control  studies  of  DHEA  and 
DHEAS  conducted  in  postmenopausal  women  have 
been  mixed  (2,  20-22).  The  three  prospective  cohort 
studies  which  examined  the  relation  of  testosterone 
with  breast  cancer  risk  in  postmenopausal  women  also 
measured  DHEAS;  one  study  (23)  reported  no  associ¬ 
ation,  whereas  the  two  others  (13,  24)  observed  a 
nonsignificant  positive  association. 

We  report  here  on  the  relation  between  postmeno¬ 
pausal  serum  levels  of  testosterone  and  DHEAS  and 
subsequent  risk  of  breast  cancer  in  a  case-control  study 
nested  within  a  prospective  cohort,  the  New  York 
University  (NYU)  Women’s  Health  Study.  We  previ¬ 
ously  reported  a  positive  association  between  post¬ 
menopausal  serum  fractions  of  bioavailable  estrogens 
and  risk  of  breast  cancer 'in  this  study  population  (25). 
A  specific  objective  of  our  analysis  was  to  examine 
whether  serum  levels  of  androgens  have  an  effect  on 
breast  cancer  risk  independent  of  their  influence  on 
serum  levels  and  biologic  availability  of  estrogens. 

MATERIALS  AND  METHODS 

The  NYU  Women’s  Health  Study  cohort 

Between  March  1985  and  June  1991,  the  NYU 
Women’s  Health  Study  enrolled  a  cohort  of  14,275 
women  aged  34-65  years  at  the  Guttman  Breast  Di¬ 
agnostic  Institute,  a  breast  cancer  screening  center  in 
New  York  City.  Details  concerning  subject  recruit¬ 


ment  have  been  published  elsewhere  (25,  26).  The 
current  report  is  limited  to  the  7,054  cohort  members 
who  were  postmenopausal  at  the  time  of  enrollment. 
Participants  were  classified  as  postmenopausal  if  they 
reported  either:  1)  no  menstrual  cycles  during  the 
preceding  6  months,  2)  a  total  bilateral  oophorectomy, 
or  3)  a  hysterectomy  without  complete  oophorectomy 
prior  to  natural  menopause  and  were  52  years  of  age  or 
older.  Cohort  members  donated  30  mL  of  blood  and 
completed  a  self-administered  questionnaire  at  enroll¬ 
ment.  Blood  was  drawn  prior  to  breast  examination, 
between  9:00  A.M.  and  3:00  P.M.  in  nonfasting 
women.  After  centrifugation,  serum  samples  were  im¬ 
mediately  stored  at  -80°C  for  subsequent  biochemi¬ 
cal  analyses.  Women  who  had  taken  hormonal  medi¬ 
cations  in  the  6  months  preceding  their  visit  were  not 
eligible. 

Nested  case-control  study 

Cases  of  breast  adenocarcinoma  were  identified  pri¬ 
marily  through  active  follow-up  of  the  cohort  and  were 
confirmed  by  review  of  individual  clinical  and  pathol¬ 
ogy  records  (25).  For  each  case  diagnosed  in  a  woman 
who  was  postmenopausal  at  enrollment,  two  controls 
were  selected  at  random  from  the  risk  set  of  women 
who  were  alive  and  free  of  disease  at  the  time  of 
diagnosis  of  the  case,  and  who  matched  the  case  on 
age  at  enrollment  (±6  months),  date  of  initial  blood 
donation  (±3  months),  and  menopausal  status.  As  of 
October  1991,  130  members  of  the  postmenopausal 
cohort  had  been  identified  who  had  received  a  diag¬ 
nosis  of  breast  cancer  prior  to  January  1,  1991.  Serum 
assays  of  follicle-stimulating  hormone  (FSH)  were 
conducted  to  confirm  the  postmenopausal  status  of  all 
the  cases  and  their  selected  controls:  three  controls, 
who  had  reported  the  absence  of  menstrual  cycles  in 
the  6  months  prior  to  enrollment,  had  FSH  levels 
below  17.5  lUAiter,  which  was  less  than  the  minimal 
level  compatible  with  postmenopausal  status  for  our 
assay.  They  were  nonetheless  included  in  the  analysis, 
since  excluding  them  did  not  materially  affect  risk 
estimates.  Estrogen  assays  (total  estradiol,  %  estradiol 
free,  and  %  estradiol  bound  to  SHBG)  were  performed 
for  all  matched  sets.  For  logistical  reasons,  androgen 
assays  were  carried  out  in  a  subset  of  1 1 8  matched 
sets.  Excluded  from  the  analyses  reported  here  are  33 
matched  sets  for  whom  diagnosis  of  the  case  occurred 
6  months  or  less  after  blood  donation,  six  controls  who 
reported  treatment  with  corticosteroids  in  the  6  months 
prior  to  blood  donation,  and  one  control  whose  estro¬ 
gen  assays  were  done  on  a  different  day  than  the 
matching  case.  As  a  result,  85  cases  (83  invasive  and 
2  noninvasive  intraductal)  and  163  controls  are  in¬ 
cluded  in  the  present  report. 
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Laboratory  methods 

For  androgen  assays,  serum  samples  that  had  not 
been  previously  defrosted  were  shipped  in  dry  ice  to 
the  Netherlands  Cancer  Institute  and  analyzed  in  two 
batches.  Samples  from  a  case  and  her  matched  con¬ 
trols  were  always  analyzed  in  the  same  batch.  All 
assays  were  performed  in  duplicate  with  the  laboratory 
personnel  blinded  to  the  case  or  control  status  of  the 
samples.  Reference  sera  were  included  for  each  assay 
in  several  places  within  each  batch. 

Total  testosterone  was  measured  by  a  solid-phase 
radioimmunoassay  (Coat-A-Count,  Diagnostic  Prod¬ 
ucts  Corp.,  Los  Angeles,  California)  not  requiring 
extraction  or  chromatography.  The  mean  intra-assay 
coefficient  of  variation  in  the  range  of  measurement 
was  6.2  percent.  The  inter-assay  coefficients  of  vari¬ 
ation  were,  respectively,  11  percent  at  1.67,  2.2  per¬ 
cent  at  10.00,  and  7.0  percent  at  22.05  nmol/liter. 

DHEAS  was  measured  directly  in  diluted  serum  as 
we  have  reported  previously  (27)  using  an  antiserum 
against  DHEA  which  showed  a  42  percent  crossreac¬ 
tivity  with  DHEAS.  As  DHEA  was  present  in  serum  in 
concentrations  at  least  10  times  lower  than  DHEAS,  it 
had  a  negligible  influence  on  the  DHEAS  values, 
which  were  read  from  a  DHEAS  standard  plot.  The 
mean  intra-assay  coefficient  of  variation  in  the  range 
of  measurement  was  3.3  percent.  The  inter-assay  co¬ 
efficients  of  variation  were,  respectively,  10  percent  at 
1.68,  9  percent  at  2.97,  10  percent  at  5.51,  and  8 
percent  at  15.18  jumol/liter. 

Total  estradiol  was  measured  by  standard  radioim¬ 
munoassay  (Pantex,  Inc.,  Santa  Monica,  California). 
Percent  estradiol  bound  to  SHBG  and  %  estradiol  free 
were  measured  with  a  concanavalin  A-sepharose  bind¬ 
ing  and  an  ultrafiltration  method,  respectively,  as  re¬ 
ported  previously  (25). 

Statistical  methods 

When  treated  as  continuous,  total  estradiol,  testos¬ 
terone,  and  DHEAS  were  logg-transformed  to  reduce 
departures  from  the  normal  distribution.  The  paired  t 
test  was  used  to  compare  hormone  levels  of  the  cases 
to  the  mean  hormone  levels  of  their  matched  controls. 

To  compute  odds  ratios,  hormonal  measurements 
were  categorized  into  quartiles,  using  the  frequency 
distribution  of  the  cases  and  the  controls  combined. 
Because  the  androgen  assays  were  performed  in  two 
batches,  quartile  cut-points  were  calculated  separately 
for  each  batch.  The  weighted  averages  of  the  cut- 
points  are  reported  in  the  tables. 

The  data  were  analyzed  using  conditional  logistic 
regression  (28).  Odds  ratios  were  computed  relative  to 
the  lowest  quartile.  Regression  analyses  were  also 


performed  on  the  continuous  hormonal  variables. 
Likelihood  ratio  tests  were  used  to  assess  the  statistical 
significance  of  overall  associations,  linear  trends,  and 
deviations  from  linearity.  All  p  values  are  two-sided. 

One  objective  of  the  analysis  was  to  examine  con¬ 
currently  the  effect  of  androgens  and  estrogens.  There¬ 
fore,  we  report  on  the  effect  of  adding  androgen  vari¬ 
ables  to  models  containing  estrogen  variables,  and 
vice-versa.  When  adding  estrogen  variables  to  models 
containing  androgen  variables,  %  SHBG-bound  estro¬ 
gen  was  entered  first  because  it  was  the  estrogen 
variable  most  strongly  associated  with  breast  cancer 
risk  in  multivariate  models  (25). 

Hormone  levels  in  this  study  were  assessed  from  a 
single  blood  donation.  For  some  hormones,  however,  a 
single  measurement  may  not  provide  a  reliable  esti¬ 
mate  of  a  woman’s  long-term  average  level,  the  expo¬ 
sure  of  interest,  because  of  intrinsic  fluctuations  in  the 
hormone  over  time  and  laboratory  measurement  error. 
In  addition,  different  hormones  are  measured  with 
varying  amounts  of  error.  For  example,  the  reliability 
coefficients  of  total  estradiol,  %  estradiol  bound  to 
SHBG,  and  DHEAS  were  estimated  to  be  0.51,  0.94, 
and  0.75,  respectively,  in  our  study  population  (29). 
The  reliability  of  testosterone  was  not  assessed  in  our 
study,  but  estimates  from  the  literature  range  from 
0.74  (30)  to  0.88  (31).  We  were  concerned  that  these 
differences  might  distort  our  results  regarding  the 
relative  importance  of  the  hormones.  We  therefore 
applied  the  method  of  Armstrong  et  al.  (32)  for  cor¬ 
recting  logistic  regression  parameter  estimates  of  con¬ 
tinuous  variables  for  measurement  error  in  case- 
control  data.  For  total  estradiol,  %  estradiol  bound  to 
SHBG,  and  DHEAS,  we  used  within-subject  variances 
which  we  had  previously  estimated  (29).  For  testos¬ 
terone,  we  used  the  within-subject  variance  estimate 
provided  by  Hankinson  et  al.  (Susan  Hankinson, 
Harvard  University  School  of  Public  Health,  personal 
communication,  1996).  We  assumed  that  the  different 
hormonal  variables  had  independent  measurement 
errors. 

We  examined  the  effect  of  Quetelet  index  (weight 
(kg)/height  (m)^)  on  the  androgen-breast  cancer  asso¬ 
ciations,  because  the  rate  of  conversion  of  androgens 
to  estrogens  increases  with  Quetelet  index  (33),  and 
because  the  known  positive  association  of  Quetelet 
index  with  risk  of  breast  cancer  was  confirmed  in  our 
data  (25).  The  effect  of  other  known  risk  factors  (age 
at  menarche,  parity,  age  at  first  full-term  pregnancy, 
age  at  menopause,  history  of  breast  cancer  in  a  first- 
degree  relative,  history  of  a  benign  breast  condition, 
history  of  total  oophorectomy,  lifetime  months  of  lac¬ 
tation,  and  smoking  history)  on  the  androgen-breast 
cancer  associations  was  also  examined  in  multivariate 
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conditional  logistic  analyses.  The  inclusion  of  covari¬ 
ates  other  than  Quetelet  index  in  the  statistical  analy¬ 
ses  did  not  materially  affect  the  results  and  are  there¬ 
fore  not  presented.  In  addition,  the  exclusion  from  the 
analysis  of  the  six  cases  and  13  controls  who  had  a 
total  oophorectomy  prior  to  enrollment  in  the  study 
had  no  material  impact  on  the  results  (data  not  shown). 
Results  are  therefore  presented  including  these  pa¬ 
tients. 

RESULTS 

Some  characteristics  of  the  study  group  are  given  in 
table  1.  The  median  age  at  diagnosis  of  breast  cancer 
was  61.6  years  and  the  median  duration  between  blood 
donation  and  diagnosis  was  2.7  years  (range  0.5-5.5 
years).  Known  breast  cancer  risk  factors  had  a  similar 
distribution  in  this  group  as  in  the  larger  group  on 
which  estrogen  assays  were  carried  out  (25).  There 
were  no  appreciable  differences  between  cases  and 
controls  in  age  at  menarche,  parity,  age  at  menopause, 
and  history  of  prior  oophorectomy.  Delayed  first  full- 
term  pregnancy,  history  of  breast  cancer  in  at  least  one 
first-degree  relative  45  years  old  or  younger  and  his¬ 
tory  of  a  benign  breast  condition  were  associated  with 
a  nonsignificant  increase  in  risk  of  breast  cancer,  while 
a  history  of  breastfeeding  was  associated  with  a  non¬ 
significant  protective  effect.  The  median  weight  and 
median  Quetelet  index  were  significantly  higher  in 
cases  than  in  controls. 

Table  2  shows  the  geometric  mean  levels  of  testos¬ 
terone  and  DHEAS  for  cases  and  controls.  The  mean 
testosterone  level  was  21  percent  higher  in  cases  than 


TABLE  2.  Geometric  mean,  geometric  standard  deviation, 
and  range  of  serum  levels  of  testosterone  and 
dehydroepiandrosterone  sulfate  (DHEAS)  in  breast  cancer 
patients  diagnosed  at  least  6  months  after  blood  donation 
and  their  individually  matched  controls,  New  York  University 
Women’s  Health  Study,  1985-1990 


Hormone 

Cases 
(n  =  85) 

Controls 
(n  =  163) 

Testosterone  (nmol/liter) 

Mean**  (SDf) 

1.05(1.79) 

0.87  (1.89) 

Range 

0,20-3.96 

0.14-6.96 

DHEAS  (nmol/liter) 

Mean*  (SDf) 

2.36  (2.37) 

1.96  (2.26) 

Range 

0.22-14.60 

0.12-10.43 

*  p  =  0.10,  paired  f  test. 
*♦  p  <  0.01,  paired  f  test, 
t  SD,  standard  deviation. 


in  controls  (p  <  0.01)  and  the  mean  DHEAS  level  was 
20  percent  higher  (p  =  0.10). 

Table  3  reports  odds  ratios  for  the  association  be¬ 
tween  breast  cancer  and  serum  levels  of  testosterone, 
total  estradiol,  and  %  estradiol  bound  to  SHBG.  In 
univariate  analyses,  odds  ratios  showed  a  significant 
increase  (p  =  0.02,  test  for  trend)  in  risk  of  breast 
cancer  wiA  increasing  levels  of  testosterone:  the  odds 
ratios  (95  percent  CIs)  for  the  second,  third,  and  fourth 
quartiles  relative  to  the  lowest  quartile,  were  2.4  (1.0- 
5.6),  3.5  (1.4-8.4),  and  2.7  (1. 1-6.8),  respectively. 
However,  adjusting  for  %  SHBG-bound  estradiol, 
which  was  the  estrogen  variable  most  strongly  associ¬ 
ated  with  breast  cancer  risk,  reduced  the  odds  ratios 
and  removed  the  significant  trend.  The  odds  ratios 
(95%  CIs)  were  1.5  (0.6-3.7),  2.0  (0.7-5.2),  and  1.3 


TABLE  1.  Characteristics  of  study  subjects.  New  York  University  Women’s  Health  Study,  1985-1990 


Characteristic 

Cases  (/7  =  85) 

Controls  (n=  163) 

Age  (years)  at  blood  donation,  median  (range) 

Age  ^ears)  at  diagnosis,  median  (range) 

59.2  (48.9-65.4) 

61.6  (52.2-68.6) 

59.1  (48.9-64.9) 

Age  (years)  at  menarche,  median  (range) 

No.  of  full-term  pregnancies  (%) 

13(9-16) 

13(10-17) 

0 

24.7 

23.9 

1 

17.6 

13.5 

>1 

Age  (years)  at  first  full-term  pregnancy,  median 

57.6 

62.6 

(range) 

25  (16-41) 

24  (16-43) 

Ever  breastfeeding  (%) 

20.8 

28.2 

Age  (years)  at  menopause,  median  (range) 

Breast  cancer  In  first-degree  relative  <45  years 

51.7  (31.6-57.2) 

50.9  (24.9-58.6) 

old  (%) 

8.2 

3.7 

Prior  benign  breast  condition  (%) 

57.7 

46.7 

Prior  bilateral  oophorectomy  (%) 

7.0 

8.0 

Height  (cm),  median  (range) 

162.6  (149.9-177.8) 

162.6  (147.3-177.8) 

Weight*  (kg),  median  (range) 

70.3  (47.6-122.5) 

62.6  (45.4-124.7) 

Quetelefs  Index*  (kg/m*),  median  (range) 

26.1  (19.9-43.6) 

24.0  (17.7-44.4) 

•  p  <  0.001,  paired  f  test. 
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TABLE  3.  Odds  ratios  (OR)  and  95%  confidence  intervals  (Cl)  for  the  association  between  breast 
cancer  risk  and  serum  levels  of  testosterone,  total  estradiol,  and  %  estradiol  bound  to  sex  hormone¬ 
binding  globulin  (SHBG),  New  York  University  Women’s  Health  Study,  1985-1990 


Hormonal  variable 
by  quartiles 

Unadjusted 

ORt 

95%  Cl 

Adjusted 

ORt 

95%  Cl 

Adjusted 

OR§ 

95%  Cl 

Testosterone# 

1 

1.0 

1.0 

1.0 

2 

2.4 

1. 0^5.6 

1.5 

0.6-3.7 

1.4 

0.6-3.5 

3 

3.5 

1.4^.4 

2.0 

0.7--5.2 

1.8 

0.7-5.0 

4 

27 

1.1-6.8 

1.3 

0.5--3.7 

1.2 

0.4-3.5 

p  for  trend 

♦ 

NSH 

NS 

Total  Gstradioitt 

1 

1.0 

1.0 

1.0 

2 

2.0 

0.8-5.3 

1.8 

0.7-4.8 

1.7 

0.6-^.7 

3 

4.3 

1.8-10.4 

3.6 

1. 4-9.0 

2.6 

1.0-^.8 

4 

3.8 

1.5-10,3 

2.9 

1. 0-8.3 

1.6 

0.5— 5.8 

p  for  trend 

*** 

♦ 

NS 

%  SHBG-bound  estradiolJt 

1 

1.0 

1.0 

1.0 

2 

0.43 

0.19-0.98 

0.44 

0.19-1.01 

0.44 

0.19-1.05 

3 

0.19 

0.07-0.49 

0.20 

0.07-0.56 

0.21 

0.07-0.59 

4 

0.05 

0.01-0.17 

0.05 

0.01-0.19 

0.05 

0.01’-0.21 

p  for  trend 

*** 

*p<o.o5:**p  <o.oi:***p<o.ooi. 

t  Unacflusted,  except  for  matching  factors  (age  and  serum  storage  time). 

^  For  testosterone,  odds  ratios  are  adjusted  for  %  SHBG-bound  estradiol;  for  total  estradiol  and  %  SHBG- 
bound  estradiol,  odds  ratios  are  adjusted  for  testosterone. 

§  Adjusted  for  other  hoimonal  variables  in  the  table. 

11  NS,  not  significant. 

#  The  cut-points  defining  quartiles  of  testosterone  were  0.73, 1.02,  and  1.45  nmol/mL 
tt  The  cut-points  defining  quartiles  of  total  estradiol  were  20, 30,  and  45  pg/mL. 
ii  The  cut-points  defining  quartiles  of  %  SHBG-bound  estradiol  were  34.4,  43.6,  and  51.3%. 


(0.5-3.7)  for  second,  third,  and  fourth  quartiles,  re¬ 
spectively.  Adding  total  estradiol  to  the  model  includ¬ 
ing  testosterone  and  %  SHBG-bound  estradiol  did  not 
significantly  improve  the  fit  of  the  model,  although  it 
further  reduced  the  odds  ratios  (95  percent  CIs)  to  1.4 
(0.6-3.5),  1.8  (0.7-5.0),  and  1.2  (0.4-3.5),  respec¬ 
tively.  Adding  %  free  estradiol  or  Quetelet  index  to  the 
model  containing  testosterone,  estradiol,  and  %  estra¬ 
diol  bound  to  SHBG  did  not  materially  affect  the  odds 
ratios  (data  not  shown).  A  strong  positive  association 
between  breast  cancer  risk  and  increasing  levels  of 
total  estradiol  was  also  present  in  univariate  analysis. 
This  association  remained  significant  after  adjusting 
for  testosterone  levels,  although  the  odds  ratios  and  the 
corresponding  p  values  were  somewhat  reduced.  The 
protective  effect  associated  with  increasing  percentage 
of  SHBG-bound  estradiol  was  hardly  affected  by  ad¬ 
justment  for  testosterone  levels.  In  the  model  includ¬ 
ing  the  three  hormonal  variables,  only  %  estradiol 
bound  to  SHBG  remained  significant.  Analysis  on 
continuous  variables  showed  similar  results. 

Results  of  the  analyses  correcting  for  measurement 
error  were  similar  to  results  of  the  uncorrected  analy¬ 
ses  with  respect  to  the  relative  strength  of  the  associ¬ 
ations  of  the  hormonal  variables  with  breast  cancer 
risk:  the  positive  association  of  testosterone  was  weak¬ 


ened  and  no  longer  significant  after  adjusting  for  % 
SHBG-bound  estradiol,  whereas  the  positive  associa¬ 
tion  of  total  estradiol  became  only  marginally  signif¬ 
icant  and  the  negative  association  of  %  SHBG-bound 
estradiol  remained  highly  significant  after  adjusting 
for  testosterone.  In  the  model  including  the  three  vari¬ 
ables,  only  %  SHBG-bound  estradiol  remained  signif¬ 
icant. 

Table  4  reports  odds  ratios  for  the  association  be- 

TABLE  4.  Odds  ratios  (OR)  and  95%  confidence  Intervals 
(Cl)  for  the  association  between  breast  cancer  risk  and  serum 
levels  of  dehydroeplandrosterone  sulfate  (DHEAS),  New  York 
University  Women’s  Health  Study,  1985-1990 


Quartfles  of 
OHEASt 

OHt 

95%  Cl 

OR§ 

95%  Cl 

1 

1.0 

1.0 

2 

0.7 

0.3-1 .5 

0.3 

0.1-0.9 

3 

1.0 

0.5-2.1 

0.5 

0.2-1 .3 

4 

1.6 

0.7-3.5 

0.9 

0.4-2.3 

p  for  trend 

NSH 

NS 

t  The  cut-points  defining  quartiles  of  DHEAS  wore  1 .33,  2.38, 
and  3.58  pmol/liter. 

4  Unadjusted,  except  for  matching  factors  (age  and  senim 
storage  time). 

§  Adjusted  for  %  estradiol  bound  to  sex  hormone-binding 
globulin  and  total  estradiol. 

H  NS,  not  significanL 
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tween  breast  cancer  risk  and  increasing  levels  of 
DHEAS.  In  unadjusted  analyses,  although  the  odds 
ratio  in  the  highest  quartile  was  slightly  elevated  (odds 
ratio  =  1.6,  95  percent  Cl  0.7-3.5),  there  was  no  trend 
of  increasing  risk  of  breast  cancer  with  increasing 
levels  of  DHEAS.  The  inclusion  of  estrogen  variables 
or  of  Quetelet  index  did  not  result  in  a  significant 
trend.  The  odds  ratios  for  the  association  between 
breast  cancer  risk  and  DHEAS  are  shown  adjusting  for 
%  SHBG-bound  estradiol  and  total  estradiol.  Inclusion 
of  DHEAS  in  models  containing  estrogen  variables 
did  not  materially  affect  the  associations  between  es¬ 
trogen  variables  and  breast  cancer  risk  (data  not 
shown).  Correcting  for  measurement  error  in  the  hor¬ 
monal  variables  did  not  alter  the  results. 

Finally,  analyses  were  conducted  using  only  the  56 
matched  sets  with  at  least  2  years  between  blood 
donation  and  diagnosis  of  the  case.  The  results  were 
similar  to  the  results  of  analyses  conducted  in  the 
larger  group,  both  for  testosterone  and  DHEAS  (data 
not  shown). 

Table  5  reports  the  Spearman  correlation  coeffi¬ 
cients  for  hormone  levels  and  Quetelet  index,  by  case- 
control  status.  Note  that  testosterone  was  correlated 
positively  with  total  estradiol  (r^  =  0.23  in  cases  and 
0.27  in  controls)  and  negatively  with  %  estradiol 
bound  to  SHBG  {r^  =  —0.27  in  cases  and  —0.33  in 
controls). 

DISCUSSION 

In  unadjusted  analyses  (except  for  matching  vari¬ 
ables),  we  observed  a  statistically  significant  trend  of 
increasing  risk  of  breast  cancer  with  increasing  serum 
levels  of  testosterone  in  postmenopausal  women.  Be¬ 
cause  all  cases  were  diagnosed  at  least  6  months  after 


blood  donation  (median  2.7  years)  and  because  a  sim¬ 
ilar  trend  was  observed  when  the  analysis  was  limited 
to  the  two-thirds  of  the  cases  diagnosed  at  least  2  years 
after  blood  donation,  it  seems  unlikely  that  the  higher 
levels  of  testosterone  observed  in  women  who  subse¬ 
quently  developed  the  disease  compared  with  controls 
resulted  from  the  presence  of  tumors. 

Three  previous  prospective  studies  (13-15)  have 
examined  the  association  between  serum  levels  of 
testosterone  and  breast  cancer  risk  in  postmenopausal 
women.  No  association  was  observed  in  the  Rancho 
Bernardo,  California,  study  (15),  in  which  the  age- 
adjusted  mean  testosterone  levels  were  258  pg/ml  in 
15  cases  diagnosed  at  least  one  year  after  blood  dona¬ 
tion  and  261  pg/ml  in  4(X)  noncases.  However,  results 
from  the  two  other  prospective  studies  (13,  14)  are 
consistent  with  ours  findings.  In  the  Washington 
County,  Maryland,  study  (14),  serum  levels  were  11 
percent  higher  in  39  cases  (mean  304  pg/ml)  than  in 
155  controls  (mean  274  pg/ml),  although  this  differ¬ 
ence  was  not  statistically  significant.  Finally,  in  24 
cases  diagnosed  during  the  first  3.5  years  of  follow-up 
of  a  cohort  of  4,040  postmenopausal  women  from 
northern  Italy,  the  risk  ratios  (95  percent  CIs)  for 
breast  cancer  associated  with  the  second  and  third 
tertiles  of  testosterone  were  4.8  (0.9-25.1)  and  7.0 
(1.4-36.4),  respectively  (p  for  trend  =  0.026)  (13). 

We  recently  reported  a  positive  association  between 
bioavailable  estrogens  and  subsequent  risk  of  breast 
cancer  in  a  slightly  larger  group  of  postmenopausal 
women  from  the  NYU  Women’s  Health  Study  (25). 
An  objective  of  the  present  analysis  was  to  examine 
whether  androgens  had  an  effect  on  breast  cancer  risk 
that  was  independent  of  their  influence  on  serum 
levels  and  biologic  availability  of  estrogen.  Results 


TABLE  5.  Spearman  correlation  coefficients  for  androgen  and  estrogen  levels  and  Quetelet  Index,  New 
York  University  Women’s  Health  Study,  1985-1990 


DHEASt 

Total 

estradiol 

% 

SHBGt- 

bound 

estradiol 

% 

free 

estmdioi 

Quetelet 

Index 

Controls  (n  - 163) 

Testosterone 

0.35*** 

0.27*** 

-0.33*** 

0.25** 

0.30*** 

DHEAS 

0.23** 

-0.27*** 

0.25** 

0.05 

Total  estradiol 

-0.48*** 

0.45‘»» 

0.43*** 

%  SHBG-bound  estradiol 

-0.72*** 

-0.52*** 

%  free  estradiol 

0.48*** 

Cases  (n  =  85) 

Testosterone 

0.38*** 

0.23* 

-0.27* 

0.12 

0.11 

DHEAS 

0.28* 

-0.37*** 

0.24* 

-0.06 

Total  estradiol 

-0.47*** 

0.19 

0.38*** 

%  SHBG-bound  estradiol 

-0.56*** 

-0.42*** 

%  free  estradiol 

0.29** 

•  p  <  0.05;  ••  p  <  0.01;  p  <  0.001 . 

t  DHEAS,  dehydroepiandrosterone  sulfate;  SHBG,  sex  hormone-binding  giobulin. 
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showed  that,  after  including  estrogen  variables  (% 
SHBG-bound  estradiol  and  total  estradiol)  in  our  sta¬ 
tistical  model,  the  odds  ratios  associated  with  higher 
levels  of  testosterone  were  considerably  reduced,  and 
there  was  no  longer  a  significant  trend.  A  similar 
result  was  recently  observed  in  the  reanalysis  of  a 
population-based  case-control  study  conducted  in 
Sweden  (9,  34).  Whereas,  in  univariate  analysis,  a 
significant  positive  association  was  found  between 
testosterone  and  breast  cancer  risk,  the  association 
disappeared  after  controlling  for  estrone  (and  andro- 
stenedione).  On  the  other  hand,  Berrino  et  al.  (13)  did 
not  observe  a  reduction  of  the  association  between 
breast  cancer  risk  and  levels  of  testosterone  when  they 
adjusted  for  total  estradiol.  However,  multivariate 
analysis  was  hampered  by  the  small  sample  size  of  the 
study  (24  cases). 

We  were  concerned  about  the  impact  of  measure¬ 
ment  error  in  the  hormonal  variables  on  our  results.  It 
is  well  known  that  in  the  absence  of  confounders, 
nondifferential  measurement  error  in  an  exposure  vari¬ 
able  will  result  in  an  attenuation  of  the  true  exposure/ 
disease  relation.  When  several  variables  are  measured 
with  error,  however,  the  associations  of  these  variables 
with  disease  in  a  multivariate  model  may  be  weaker  or 
stronger  than  the  true  associations  (35).  It  is  reassuring 
that,  in  our  analysis,  correcting  for  measurement  error 
did  not  affect  the  relative  strength  of  the  associations 
of  the  hormonal  variables  with  risk  of  breast  cancer. 

Our  results  are  consistent  with  the  hypothesis  that 
testosterone  has  an  indirect  effect  on  breast  cancer 
risk,  through  its  association  with  estrogen  levels.  The 
fact  that  %  SHBG-bound  estradiol  was  the  estrogen 
variable  which  caused  the  greatest  reduction  in  the 
testosterone-breast  cancer  odds  ratios  suggests  that  the 
effect  of  testosterone  on  the  bioavailability  of  estro¬ 
gens  may  be  more  important  than  its  role  as  a  precur¬ 
sor  of  estrogens.  An  increase  in  the  serum  levels  of 
testosterone  could  lead  to  a  decrease  in  %  estradiol 
bound  to  SHBG,  because  testosterone  binds  to  SHBG 
with  greater  affinity  than  estradiol.  However,  the  mod¬ 
eling  studies  performed  by'Dunn  et  al.  (36)  as  well  as 
in  vitro  experiments  (37)  indicate  that  higher  concen¬ 
trations  of  testosterone  would  be  required  to  observe 
such  an  effect.  Inhibition  of  the  hepatic  secretion  of 
SHBG  by  testosterone  could  also  result  in  a  decrease 
in  %  SHBG-bound  estradiol,  because  small  changes  in 
SHBG  concentration  can  produce  an  important  reduc¬ 
tion  in  the  percentage  of  hormone  bound  to  this  protein 
(38).  In  support  of  this  hypothesis,  a  moderate  nega¬ 
tive  correlation  between  testosterone  and  SHBG  has 
been  reported  by  some  studies  (7,  39, 40)  although  not 
all  studies  (34). 


A  limitation  of  our  study  is  that  only  total  testoster¬ 
one  was  measured.  The  free  and  albumin-bound  hor¬ 
mone  fractions  might  be  more  relevant  biologically 
because  these  fractions  are  thought  to  diffuse  readily 
into  the  cells  (41).  Indeed,  with  regard  to  estrogen,  the 
variable  most  strongly  related  to  risk  of  breast  cancer 
was  the  %  SHBG-bound  estradiol,  which  had  a  pro¬ 
tective  effect.  Thus,  we  cannot  exclude  the  possibility 
that  the  free  and  albumin-bound  fractions  of  testoster¬ 
one  might  have  an  independent  effect  on  breast  cancer 
risk. 

The  lack  of  an  association  between  DHEAS  and 
breast  cancer  observed  here  is  consistent  with  the 
results  of  the  previous  prospective  studies  which  ex¬ 
amined  the  role  of  this  hormone  in  postmenopausal 
women.  Barrett-Connor  et  al.  (23)  measured  DHEAS 
levels  in  a  cohort  of  534  women,  50-79  years  old, 
among  whom  21  subsequently  developed  breast  can¬ 
cer,  and  reported  no  difference  between  cases  and 
non-cases.  In  a  case-control  study  nested  within  a 
cohort  of  approximately  13,000  female  residents  of 
Washington  County,  Maryland,  Gordon  et  al.  (24) 
reported  that  serum  levels  of  DHEA  were  significantly 
higher  in  30  postmenopausal  women  who  developed 
breast  cancer  9  years  or  more  after  blood  donation  than 
in  59  matched  controls.  However,  no  statistically  sig¬ 
nificant  difference  in  DHEAS  levels  was  observed, 
although  serum  levels  of  DHEAS  were  slightly  higher 
in  the  women  who  developed  breast  cancer  than  in  the 
controls.  Finally,  an  increase  in  the  odds  ratios  for 
breast  cancer  was  observed  with  increasing  serum 
levels  of  DHEAS  in  an  Italian  cohort  study,  the  Study 
of  Hormones  and  Diet  in  the  Etiology  of  Breast  Cancer 
(ORDET  Study),  but  this  trend  was  not  statistically 
significant  (13).  Overall,  there  is  little  epidemiologic 
evidence  that  DHEAS  plays  an  important  role  in  breast 
cancer  development  in  postmenopausal  women. 

In  conclusion,  elevated  serum  levels  of  testosterone 
were  found  to  be  associated  with  subsequent  risk  of 
breast  cancer  in  postmenopausal  women.  However, 
this  association  was  considerably  reduced  and  no 
longer  significant  after  taking  into  account  the  effect 
of  serum  estrogen  levels  on  breast  cancer  risk,  sug¬ 
gesting  that  androgens  act  through  their  influence  on 
the  availability  of  estrogens  via  SHBG  binding  and/or 
as  precursors  of  estrogens.  There  was  no  evidence  in 
our  study  that  the  adrenal  androgen  DHEAS  plays  a 
role  in  breast  cancer  development.  In  light  of  these 
results,  additional  research  to  identify  factors  influenc¬ 
ing  testosterone  levels  in  healthy  postmenopausal 
women  would  be  of  interest.  Among  life-style  factors 
such  as  smoking,  obesity,  diet,  alcohol  consumption, 
and  exercise,  only  obesity  has  been  found  to  be  mar- 
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ginally  associated  with  higher  levels  of  testosterone 
(42,  43). 
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